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Description 

BACKGROUND OF THE INVENTION 
5 Field of the Invention 

The present invention concerns nucleic acid sequences which code for carcinoembryonic antigen 
(CEA) antigen family peptide sequences. 

w Background Information 

Carcinoembryonic antigen was first described by Gold and Freedman, J. Exp. Med., 121, 439-462, 
(1965). CEA is characterized as a glycoprotein of approximately 200,000 molecular weight with 50-60% by 
weight of carbohydrate. CEA is present during normal human fetal development, but only in very low 
75 concentration in the normal adult intestinal tract. It is produced and secreted by a number of different 
tumors. 

CEA is a clinically useful tumor marker for the management of colorectal cancer patients. CEA can be 
measured using sensitive immunoassay methods. When presurgical serum levels of CEA are elevated, a 
postsurgical drop in serum CEA to the normal range typically indicates successful resection of the tumor. 

20 Postsurgical CEA levels that do not return to normal often indicate incomplete resection of the tumor or the 
presence of additional tumor sites in the patient. After returning to normal levels, subsequent rapid rises in 
serum CEA levels usually indicate the presence of metastages. Slower postsurgical rises from the normal 
level are most often interpreted to indicate the presence of new primary tumors not previously detected. 
Post surgical management of colon cancer patients is thus facilitated by the measurement of CEA. 

25 CEA is a member of an antigen family. Because of this, the immunoassay of CEA by presently 
available methods is complicated by the fact that CEA is but one of several potentially reactive antigens. 
There have been at least sixteen CEA-like antigens described in the literature. Since some of these appear 
to be the same antigen described by different investigators, the actual number of different antigens is 
somewhat less than this number. Nonetheless, there is a complex array of cross-reactive antigens which 

30 can potentially interfere with an immunoassay of the CEA released by tumors. It is known that serum levels 
of CEA-like antigens are elevated in many non-cancerous conditions such an inflammatory liver diseases 
and also in smokers. It is important that immunoassays used for the monitoring of cancer patient status not 
be interfered with by these other CEA-like antigens. Conversely, it is important to be able to distinguish the 
antigens by immunoassays because of the possibility that different tumor types may preferentially express 

35 different forms of CEA. If so, then the ability to reliably measure the different forms of CEA can provide the 
means to diagnose or more successfully treat different forms of cancer. 

The members of the "CEA family" share some antigenic determinants. These common epitopes are not 
useful in distinguishing the members of the antigen family and antibodies recognizing them are of little use 
for measuring tumor-specific CEA levels. 

40 U.S.P. 3,663,684, entitled "Carcinoembryonic Antigen and Diagnostic Method Using Radioactive Io- 
dine", concerns purification and radioiodination of CEA for use in a RIA. 

U.S.P. 3,697,638 describes that CEA is a mixture of antigens (components A and B in this case). U.S.P. 
3,697,638 mentions methods for separating and radioiodinating each component and their use in specific 
RIA's. 

45 U.S.P. 3,852,415, entitled "Compositions for Use in Radioimmunoassay, as Substitute for Blood Plasma 
Extract in Determination of Carcinoembryonic Antigen" relates to the use of a buffer containing EDTA and 
bovine serum albumin as a substitute for plasma as a diluent for CEA RIA's. 

U.S.P. 3,867,363, entitled "Carcinoembryonic Antigens", is directed to the isolation of CEA components 
A and B, their labelling and use in a RIA. 
so U.S.P. 3,927,193, entitled "Localization of Tumors by Radiolabeled Antibodies", concerns the use of 
radiolabeled anti-CEA antibodies in whole body tumor imaging. 

U.S.P. 3,956,258, entitled "Carcinoembryonic Antigens", relates to the isolation of CEA components A 
and B. 

U.S.P. 4,086,217, entitled "Carcinoembryonic Antigens", is directed to the isolation of CEA components 
55 A and B. 

U.S.P. 4,140,753, entitled "Diagnostic Method and Reagent", concerns the purification of a CEA isomer 
called CEA-S1 and its use in a RIA. 

U.S.P. 4,145,336, entitled "Carcinoembryonic Antigen Isomer", relates to the antigen CEA-S1. 
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U.S.P. 4,180,499, entitled "Carcinoembryonic Antigens", describes a process for producing CEA 
component B. 

U.S.P. 4,228,236, entitled "Process of Producing Carcinoembryonic Antigen", is directed to the use of 
the established cell lines LS-174T and LS-180 or clones or derivatives thereof for the production of CEA. 
5 U.S.P. 4,272,504, entitled "Antibody Adsorbed Support Method for Carcinoembryonic Antigen Assay", 
concerns two concepts for the radioimmunoassay of CEA. First, U.S.P. 4,272,504 relates to a sample 
pretreatment in the form of heating to 65 to 85° C at pH 5 to precipitate and eliminate extraneous protein. 
Second, it describes the use of a solid phase antibody (either on beads or tubes) as a means to capture 
analyte and radiolabeled CEA tracer. 
70 U.S.P. 4,299,815, entitled "Carcinoembryonic Antigen Determination", concerns diluting a CEA sample 
with water and pretreating by heating to a temperature below which precipitation of protein will occur. The 
pretreated sample is then immunoassayed using RIA, EIA, FIA or chemiluminescent immunoassay. 

U.S.P. 4,349,528, entitled "Monoclonal Hybridoma Antibody Specific for High Molecular Weight Car- 
cinoembryonic Antigen", is directed to a monoclonal antibody reacting with 180 kD CEA, but not with other 
75 molecular weight forms. 

U.S.P. 4,467,031, entitled "Enzyme-lmmunoassay for Carcinoembryonic Antigen", relates to a sandwich 
enzyme immunoassay for CEA in which the first of two anti-CEA monoclonal antibodies is attached to a 
solid phase and the second monoclonal is conjugated with peroxidase. 

U.S.P. 4,489,167, entitled "Methods and Compositions for Cancer Detection", describes that CEA 
20 shares an antigenic determinant with alpha-acid glycoprotein (AG), which is a normal component of human 
serum. The method described therein concerns a solid-phase sandwich enzyme immunoassay using as one 
antibody an antibody recognizing AG and another antibody recognizing CEA, but not AG. 

U.S.P. 4,578,349, entitled "Immunoassay for Carcinoembryonic Antigen (CEA)", is directed to the use of 
high salt containing buffers as diluents in CEA immunoassays. 
25 EP 113072-A, entitled "Assaying Blood Sample for Carcinoembryonic Antigen - After Removal of 
Interfering Materials by Incubation with Silica Gel", relates to the removal from a serum of a plasma sample 
of interfering substances by pretreatment with silica gel. The precleared sample is then subjected to an 
immunoassay. 

EP 102008-A, entitled "Cancer Diagnostics Carcinoembryonic Antigen - Produced from Perchloric Acid 
30 Extracts Without Electrophoresis", relates to a procedure for the preparation of CEA from perchloric acid 
extracts, without the use of an electrophoresis step. 

EP 92223-A, entitled "Determination of Carcinoembryonic Antiyen in Cytosol or Tissue - for Therapy 
Control and Early Recognition of Regression", concerns an immunoassay of CEA, not in serum or plasma, 
but in the cytosol fraction of the tumor tissue itself. 
35 EP 83103759.6, entitled "Cytosole-CEA-Measurement as Predictive Test in Carcinoma, Particularly 
Mammacarcinoma", is similar to EP 92223-A. 

EP 83303759, entitled "Monoclonal Antibodies Specific to Carcinoembryonic Antigen", relates to the 
production of "CEA specific" monoclonal antibodies and their use in immunoassays. 

WO 84/02983, entitled "Specific CEA-Family Antigens, Antibodies Specific Thereto and Their Methods 
40 of Use", is directed to the use of monoclonal antibodies to CEA-meconium (MA)-, and NCA-specific 
epitopes in immunoassays designed to selectively measure each of these individual components in a 
sample. 

All of the heretofore CEA assays utilize either monoclonal or polyclonal antibodies which are generated 
by immunizing animals with the intact antigen of choice. None of them address the idea of making 
45 sequence specific antibodies for the detection of a unique primary sequence of the various antigens. They 
do not cover the use of any primary amino acid sequence for the production of antibodies to synthetic 
peptides or fragments of the natural product. They do not include the concept of using primary amino acid 
sequences to distinguish the CEA family members. None of them covers the use of DNA or RNA clones for 
isolating the genes with which to determine the primary sequence. 

50 
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. DEFINITIONS 
Nucleic Acid Abbreviations 

A adenine 

G guanine 

C cytosine 

T thymidine 

U uracil 

Amino Acid Abbreviations : 



Asp 


aspartic acid 


Asn 


asparagine 


inr 


threonine 


C A IT 


serine 




glutamic acid 


Gin 


glutamine 


Pro 


proline 


Gly 


glycine 


Ala 


alanine 


Cys 


cysteine 


Val 


valine 


Met 


methionine 


He 


isoleucine 


Leu 


leucine 


Tyr 


tyrosine 


Phe 


phenylalanine 


Trp 


tryptophan 


Lys 


lysine 


His 


histidine 


Arg 


arginine 



Nucleotide - A monomeric unit of DNA or RNA containing a sugar moiety (pentose), a phosphate, and a 
nitrogenous heterocyclic base. The base is linked to the sugar moiety via the glycosidic carbon (V carbon 
of the pentose) and that combination of base and sugar is called a nucleoside. The base characterizes the 
nucleotide. The four DNA bases are adenine ("A"), guanine ("G"), cytosine ("C"), and thymine ( n T"). The 
four RNA bases are A, G, C and uracil ("IT). 

DNA Sequence - A linear array of nucleotides connected one to the other by phosphodiester bonds 
between the 3' and 5' carbons of adjacent pentoses. 

Functional equivalents - It is well known in the art that in a DNA sequence some nucleotides can be 
replaced without having an influence on the sequence of the expression product. With respect to the 
peptide this term means that one or more amino acids which have no function in a particular use can be 
deleted or replaced by another one. 

Codon - A DNA sequence of three nucleotides (a triplet) which encodes through mRNA an amino acid, 
a translation start signal or a translation termination signal. For example, the nucleotide triplets TTA, TTG, 
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CTT, CTC, CTA and CTG encode the amino acid leucine ("Leu"), TAG, TAA and TGA are translation stop 
signals and ATG is a translation start signal. 

Reading Frame - The grouping of codons during translation of mRNA into amino acid sequences. 
During translation, the proper reading frame must be maintained. For example, the sequence 
5 GCTGGTTGTAAG may be translated in three reading frames or phases, each of which affords a different 
amino acid sequence 

GCT GGT TGT AAG - Ala-Gly-Cys-Lys 
w G CTG GTT GTA AG - Leu-Val-Val 

GC TGG TTG TAA G - Trp-Leu- (STOP) . 



Polypeptide - A linear array of amino acids connected one to the other by peptide bonds between the 
75 alpha-amino and carboxy groups of adjacent amino acids. 

Genome - The entire DNA of a cell or a virus. It includes inter alia the structural genes coding for the 
polypeptides of the cell or virus, as well as its operator, promoter~and ribosome binding and interaction 
sequences, including sequences such as the Shine-Dalgarno sequences. 

Structural Gene - A DNA sequence which encodes through its template or messenger RNA ("mRNA") a 
20 sequence of amino acids characteristic of a specific polypeptide. 

Transcription - The process of producing mRNA from a structural gene. 
Translation - The process of producing a polypeptide from mRNA. 

Expression - The process undergone by a structural gene to produce a polypeptide. It is a combination 

of transcription and translation. 
25 Plasmid - A non-chromosomal double-stranded DNA sequence comprising an intact "replicon" such 

that the plasmid is replicated in a host cell. When the plasmid is placed within a unicellular organism, the 

characteristics of that organism may be changed or transformed as a result of the DNA of the plasmid. For 

example, a plasmid carrying the gene for tetracycline resistance (Tet R ) transforms a cell previously sensitive 

to tetracycline into one which is resistant to it. A cell transformed by a plasmid is called a "transformant". 
30 Phage or Bacteriophage - Bacterial virus, many of which consist of DNA sequences encapsulated in a 

protein envelope or coat ("capsid protein"). 

Cloning Vehicle - A plasmid, phage DNA or other DNA sequence which is capable of replicating in a 

host cell, which is characterized by one or a small number of endonuclease recognition sites at which such 

DNA sequences may be cut in a determinable fashion without attendant loss of an essentia! biological 
35 function of the DNA, e.g., replication, production of coat proteins or loss of promoter or binding sites, and 

which contains a marker suitable for use in the identification of transformed cells, e.g., tetracycline 

resistance or ampicillin resistance. A cloning vehicle is often called a vector. 

Cloning - The process of obtaining a population of organisms or DNA sequences derived from one such 

organism or sequence by asexual reproduction. 
40 Recombinant DNA Molecule or Hybrid DNA - A molecule consisting of segments of DNA from different 

genomes which have been joined end-to-end outside of living cells and have the capacity to infect some 

host cell and be maintained therein. 

cDNA Expression Vector - A procaroytic cloning vehicle which also contains sequences of nucleotides 

that facilitate expression of cDNA sequences in eucaroytic cells. These nucleotides include sequences that 
45 function as eucaryotic promoter, alternative splice sites and polyadenylation signals. 

Transformation/Transfection - DNA or RNA is introduced into cells in such a way as to allow gene 

expression. "Infected" referred to herein concerns the introduction of RNA or DNA by a viral vector into the 

host. 

"Injected" referrred to herein concerns the microinjection (use of a small syringe) of DNA into a cell, 
so CEA antigen family (CEA gene family) - a set of genes (gene family) and their products (antigen 
family) that share nucleotide sequences homologous to partial cDNA LV-7 (CEA-(a)) and as a result of 
theses similarities also share a subset of their antigenic epitopes. Examples of the CEA antigen family 
include CEA ( = CEA-(b)), transmembrane CEA (TMCEA) = CEA-(c) and normal crossreacting antigen NCA 
( = CEA-(d)). 

55 
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SUMMARY OF THE INVENTION 

The present invention concerns the following DNA sequences designated as TM-2 (CEA-(e)), TM-3 
(CEA-(f)), TM-4 (CEA-(g)), KGCEA1 and KGCEA2, which code for CEA antigen family peptide sequences: 



SEQUENCE AND TRANSLATION OF CDNA OF TM-2 

10 30 50 

• • ■ ■ « . 

CAGCCCTGCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGCTGAACACAGCGCCA 

70 90 HO 

CCAGGAGACACCAT GGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
Me tClyHisLeuSerAlaProLeuHisArgValArgValProTrpGin 

130 150 170 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuThrAlaSerLeuLeuThrPheTrpAsnProProThrThcAlaGlnLeu 

190 210 230 

ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
Th rTh cG 1 uS e r Me tProPheAsnValAlaGluGlyLysGluValLeuLeuLeuValHi s 

250 270 290 

AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
AsnLeuProGlnGlnLeuPheGlyTyrSerTrpTyrLysGlyGluAcgValAspGlyAsn 

310 330 350 

CGTCAAATTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
ArgGlnlleValGlyTyrAlalleGlyThrGlnGlnAlaThrProGlyProAlaAsnSer 

370 390 410 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAGAATGAC 
GlyArgGluThrlleTyrProAsnAlaSe rLeuLeuIleGlnAsnValThcGlnAsnAsp 

<30 450 470 

ACAGGATTCTA-t ACCCTACAAGTCATAmAAGTCAGATCTTGTGAATG AAGAACC AACrGGA 
ThrGlyPheTy cTh r LeuGlnVaH 1 eLy sSe c As pLe uVa lAs nG 1 uGl uAl aTh rGly 
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/ 

I 



490 510 530 

CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHi sValTy rProGluLeuProLysProSe rlleSerSe rAsnAsnSerAsnPr o 



550 570 590 

GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAlaValAlaPheThrCysGluProGluThrClnAspThrThrTy r 



610 630 650 

CTGTGGTGGATAAA'CAATCAGAGCCTCCCGGTCAGTCCCAGGCTCCAGCTGTCCAATGGC 
LeuTrpTrpI leAsnAsnGlnSe r LeuProValSe rProArgLeuGlnLeuSe rAsnGly 



670 690 710 

AACACGACCCTCACTC7ACTCACTGTCACAAGGAATGACACAGGACCCTATGAGTGTGAA 
AsnArgThrLeuThrLeuLeuSecValThrArgAsnAspThrGlyProTy rGluCy sGlu 



730 750 770 

* 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCACTCACCTTCAATGTCACCTATGCC 
IleGlnAsnPcoValSe rAlaAsnAcgSerAspPcoValThrLeuAsnValThcTy rGly 



790 810 830 

CCGCACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCACGGGCAAACCTCAGC 
ProAspThrProThrlleSerProSerAspThrTyrTycArgProGlyAlaAsnLeuSer 

850 870 890 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTCGCTTATCAATGGAACA 
LeuSecCysTyrAlaAlaSerAsnProProAlaClnTycSe cTrpLeu! 1 eAsnGlyTh r 



910 930 950 

* • • • « . 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTCGATCC 
PheGlnGlnSerThrGlnGluLeuPhelleProAsnlleThrValAsnAsnSerGlySe r 



970 990 1010 

* • • • • 

TATACCTCCtACGCCAATAACTCAGTCACTCGCTGCAACAGGACCACAGTCAACACCATC 
Ty rThrCysHisAlaAsnAsnSerValThrGlyCysAsnAcgThrThrValLysThr He 



5 
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1030 10S0 1070 

ATAGTCACTGATAATCCTCTACCACAAGAAAATGGCCTCTCACCTCGGGCCATTGCTGCC 

IleValThrAspAsnAlaLeuProGlnGluAsnGlyLeuSerProGlyAlalleAlaGly 

1090 1110 1130 

ATTGTGATTGCAGTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTG 
w IleVallleGlyValValAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeu 

1150 1170 1190 

• • • a • . 

7g CATTTCCGGAAGACCGGCAGGCCAACCGACCAGCGTCATCTCACAGACCACAAACCCTCA 
Hi s Ph eG ly Ly sTh cGl yArgAlaSerAspGlnArgAspLeuTh CGI uHisLysP rose r 

1210 1230 1250 

20 GTCTCCAACCACACTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGAAGTTACT 
ValSe rAsnaisThrGlnAspHi sSe TAsnAspProProAsnLysMe tAsnGluValTh r 

1270 1290 1310 

25 TATTCTACCCTGAACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTCCCCATCC 
TyrS*^ rThrLeuAsnPheGluAlaGlnGlnProThrGlnProThrSe rAlaSerProSec 

1330 1350 1370 

30 ...... 

CTAACAGCCACAGAAATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCTGTCCTGC 
LeuThrAlaThrGluIlel leTyrSerGluValLysLysGln 

1390 1410 1430 

35 

TCACTGCAGTGCTGATGTATTTCAAGTCTCTCACCCTCATCACTAGGAGATTCCTTTCCC 
1450 1470 1490 

40 

CTGTAGGGTAGAGGGGTGGGGACAGAAACAACTTTCTCCTACTCTTCCTTCCTAATAGGC 

1510 1530 1550 

45 ATCTCCAGGCTGCCTGGTCACTGCCCCT CTCTCAGTGTCAATAGATGAAAGTACATTGGG 

1570 1590 1610 

AGTCTGTAGGAAACCCAACCTTCTTGTCATTGAAATTTGGCAAAGCTGACTTTCGGAAAG 

50 
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1630 1650 1670 

ACGGACCACAACTTCCCCTCCCTTCCCCTTTTCCCAACCTCGACTTGTTTT/^ACTTGCC 

1690 1710 1730 

TGTTCAGAGCACTCATTCCTTCCCACCCCCAGTCCTGTCCTATCACTCTAATTCCGATTT 

1750 1770 1790 

GCCATAGCCTTGACGTTATGTCCTTTTCCATTAAGTACATGTGCCAGGAAACAGCGAGAC 

18l ° 183J0 . 1850 

AGAGAAAGTAAACGGCAGTAATGCTTCTCCTATTTCTCCAAAGCCTTGTGTGAACTAGCA 

1870 1890 igio 

AAGAGAAGAAAATCAAATATATAACCAATAGTGAAATGCCACAGGTTTGTCCACTGTCAG 

1930 1950 1970 

GGTTGTCTACCTGTAGGATCAGGGTCTAAGCACCTTGGTGCTTAGCTAGAATACCACCTA 

1990 2010 2030 

ATCCTTCTGGCAAGCCTGTCTTCAGAGAACCCACTAGAAGCAACTACGAAAAATCACTTG 

2050 2070 2090 

CCAAAATCCAAGGCAATTCCTGATGGAAAATGCAAAAGCACATATATGTTTTAATATCTT 

2H0 2130 2150 

TATCGGCTCTGTTCAAGGCAGTGCTGAGAGGGAGGGGTTATAGCTTCAGGAGGGAACCAG 

2170 2190 2210 

CTTCTGATAAACACAATCTGCTAGGAACTTGGGAAAGGAATCAGAGAGCTGCCCTTCACC 
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2230 2250 2270 

• * • . , . 

GATTATTTAAATTCTTAAAGAATACACAATTTGGGGTATTCGGATTTTTCTCCTTTTCTC 
2290 2310 2330 

• . . ■ . 

TGAGACATTCCACCATTTTAATTTTTGTAACTGCTTATTTATGTGAAAAGGGTTATTTTT 

2350 2370 2390 

ACTTACCTTAGCTATGTCAGCCAATCCGATTGCCTTAGGTGAAAGAAACCACCGAAATCC 

75 24 10 2430 2450 

CTCAGGTCCCTTGGTCAGGAGCCTCTCAAGATTTTTTTTGTCAGAGGCTCCAAATAGAAA 



70 



20 



25 



30 



2470 2490 2510 

ATAAGAAAACGTTTTCTTCATTCATGGCTAGAGCTAGATTTAACTCACTTTCTAGGCACC 

2530 2550 2570 

TCAGACCAATCATCAACTACCATTCTATTCCATGTTTGCACCTGTGCATTTTCTGTTTGC 

2590 2610 2630 

CCCCATTCACTTTGTCAGGAAACCTTGGCCTCTGCTAAGGTGTATTTGGTCCTTGAGA-^G 



2650 2670 2690 

35 TCGGAGCACCCTACAGGGACACTATCACTCATCCTGGTGGCATTGTTTACAGCTAGAAAG 

2710 2730 2750 

40 CTGCACTGGTGCTAATGCCCCTTGGGAAATGGGGCTGTGAGGAGGAGGATTATAACTTAG 

2770 2790 2010 

GCCTACCCTCTTTTAACAGCCTCTGAAATTTATCTTTTCTTCTATGGGGTCTATAAATGT 

45 

2030 2850 2870 

/sTCTTATAJkTA^AAAGGAACGACAGGAGGAAGACAGGCAAATGTACTTCTCACCCAGTCT 

50 
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( 



2890 2910 2930 . 

TCTACACAGATGGAATCTCTTTGGCCCTAAGAGAAAGGTTTTATTCTATATTGCTTACCT 

2950 2970 2990 

GATCTCATGTTAGGCCTAAGAGGCTTTCTCCAGGAGCATTAGCTTGGAGTTCTCTATACT 

3010 3030 3050 

CAGGTACCTCTTTCAGGGTTTTCTAACCCTGACACGGACTGTGCATACTTTCCCTCATCC 

3070 3090 3110 

ATGCTGTGCTGTGTTATTTAAT TTTTCCTGGCTAAGATCATGTCTGAATTATGTATGAAA 

3130 3150 3170 

ATTATTCTATGTTTTTATAATAAAAATAATATATCAGACATCGAAAAAAAAAA 
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SEQUENCE AND TRANSLATION OF cDNA OF TM-3 

10 30 50 

10 CAGCCGTGCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGACAGGGCCA 

70 90 • 110 

75 GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 

Me tGlyHi sLeuSe rAlaPcoLeuHi sAcgValAcgValP roTrpGln 

130 150 170 

20 ...... 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuThrAlaSe r LeuLeuThr PheTrpAsnProProThrThrAlaGlnLeu 



190 210 230 

ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSerMetProPheAsnValAlaGluGlyLysGluValLeuLeuLeuValHi s 



30 



250 270 290 

• •••** 
AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
AsnLeuProGlnGlnLeuPheGlyTy rSe rTcpTy rLysGlyGluArgValAspGlyAsn 



310 330 - - 350 

• « - . . . . 

CGTCAAATTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
ArgGlnlleValGlyTycAlalleGlyThcGlnGlnAlaThrProGlyProAlaAsnSer 



370 390 410 

♦ » • • . • 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAGAATGAC 
GlyArgGluThrl leTyrProAsnAlaSerLeuLeuIleGlnAsnValThrGlnAsnAsp 



50 



55 



( 
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( 



430 450 470 

ACAGGATTCTACACCCTACAAGTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGA 

ThrGlyPheTy rThr LeuGlnVallleLysSerAspLeuValAsnGluGluAlaThrGly 



490 510 530 

CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHisValTy cPro'GluLeuProLysProSe rlleSe rSe rAsnAsnSe rAsnPro 



550 570 590 

GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAlaValAlaPheThrCysGluProGluThrGlnAspThrThrTyr 



610 630 650 

■ • • • > « 

CTGTGGTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuTrpTcpl 1 eAsnAsnGlnSe rLeuProValSe rPr oArgLeuGlnLeuSe rAsnGly 



670 690 710 

AACAGGACCCTCACTCTACTCAGTGTCACAAGGAATGACACAGGACCCTATGAGTGTGAA 
AsnArgThrLeuThrLeuLeuSerValThrArgAsnAspThrGlyProTyrGluCysGlu 



730 750 770 

• • > • 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
IleGlnAsnProValSerAlaAsnArgSerAspProValThrLeuAsnValThrTyr.Gly 



790 810 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
ProAspThrProThrlleSerProSerAspThrTyrTyrArgProGlyAlaAsnLeuSer 
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850 870 890 

• • • ■ • • 

CTCTCCTCCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGGAACA 
LeuSe rCysTyrAlaAlaSe rAsnProProAlaGlnTyrSerTrpLeuIleAsnGlyTh r 

910 930 950 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PheGlnGlnSerThrGlnGluLeuPhelleProAsnlleThrValAsnAsnSerGlySer 

970 990 1010 

• • • • • • 
TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
Ty rThrCysHisAlaAsnAsnSe rValThrGlyCysAsnArgThrThrValLysThrl le 

1030 1050 1070 

ATAGTCACTGAGCTAAGTCCAGTAGTAGCAAAGCCCCAAATCAAAGCCAGCAAGACCACA 
IleValThrGluLeuSe rProValValAlaLysProGlnlleLysAlaSerLysThrThr 

1090 1110 1130 

GTCACAGGAGATAAGGACTCTGTGAACCTGACCTGCTCCACAAATGACACTGGAATCTCC 
ValThrGlyAspLysAspSerValAsnLeuThrCysSerThrAsnAspThrGlylleSer 

1150 1170 1190 

■ ••••« 
ATCCGTTGGTTCTTCAAAAACCAGAGTCTCCCGTCCTCGGAGAGGATGAAGCTGTCCCAG 
UeArgTrpPhePheLysAsnGlnSerLeuProSerSerGluArgMetLysLeuSerGln 

1210 1230 1250 



GGCAACACCACCCTCAGCATAAACCCTGTCAAGAGGGAGGATGCTGGGACGTATTGGTGT 
GlyAsnThrThrLeuSerlleAsnProValLysArgGluAspAlaGlyThrTyrTrpCys 
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1270 1290 1310 

• • • • 

GAGGTCTTCAACCCAATCAGTAAGAACCAAAGCGACCCCATCATGCTGAACGTAAACTAT 
5 GluValPheAsnProIleSe rLysAsnGlnSe rAspProIleMe tLeuAsnValAsnTy r 

1330 1350 1370 



10 



15 



20 



25 



30 



35 



40 



45 



50 



AATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGGCCATTGCTGGCATTGTGATTGGA 
AsnAlaLeuProGlnGltfAsnGlyLeuSerPcoGlyAlalleAlaGlylleVallleGly 

1390 1410 1430 

GTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTGCATTTCGGGAAG 
valValAlaLeuValAlaLeulleAlaValAlaLeuAlaCysPheLeuHisPheGlyLys 

1450 1470 1490 

• • • • • • 
ACCGGCAGCTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGAAGTTACTTATTC 
ThrGlySe rSe rGlyProLeuGln 

1510 1530 1550 

TACCCTGAACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTCCCCATCCCTAAC 

1570 1590 1610 

• • • » • • 
AGCCACAGAAATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCTGAAAAAAAAAAA 

1630 
AAAAAAAAAA 



55 
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SEQUENCE AND TRANSLATION OF cDNA OF TM-4 

10 30 5b 

C AC CCGTGCTCCAACCGTTCCTGG AG CCCAAGCTCTCCTCCACACCTGAAGACAGGGCC A 

70 .90 UQ 

GGAGGAGAG ACCATGGGGcAcCTCTCAGCCCCACTTCACAGAGT JCGTGTACCCTGGCAG 
M et G lyH^sLeuSe^AlaPcoLeuHisArgValArgValProT^pCln 



i30 - 150 



170 



GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCACCTr 
G ^euLeuLeuThcAlaSecLeuLeuThrPheTrpAsnPro^ 

190 ' 210 2 3 0 

iKT^P^t ATCCATGCCA7TC ^ vTGTTGCAGAGCGGAA GGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSerMetProPheAsnValAlaGluGlyLysGluValLeuLuL^SalHU 



250 270 290 

;g; 

:pGlyAi 



AsnLeuProGlnGlnLeuPheGlyTyrSerTrpTyrLysGlyGluArgValAspGlyAsn 



310 330 



350 



CGTCAAATTGTAGGATATGCAATACGAACTCAACAAGCTACCCCAGGGCCCGCAAACArr 

370 390 4 10 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACCTCACCCAGAATGAC 
GlyAcgGluThrlleTyrPcoAsnAlaSerLeuLeuIleGlnAsnValThrClnAsnAsp 

430 450 < 70 

^ A ^? A II CTACACCCTACAACTCATAAACTCAGAT CTTGTCAATGA^ 
ThrGlyPheTytThrLeuGlnVallleLysSerAspLeuValAsnGluG^MaThlcW 
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490 - 510 530 

CAGTTCCATGTATACCCGGAGCTCCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHi sValTyrPcoGluLeuProLysProSe rl leSe rSe r AsnAsnSe r as rip ro 



550 570 590 

• • • • . 

GTGGAGCACAAGGATGCTCTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
valGluAspLysAspAlaValAlaPheThrCysGluPioGluThrGlnAspThrThrTyr 

610 630 650 

CTGTGGTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuTrpTrpIleAs-nAsnGlnSe cLeuProValSerProAcgLcuGlnLeuSe rAsnGly 



670 690 710 

AACAGGACCCTCACTCTACTCAGTGTCACAAGGAATGACACAGGACCCTATGAGTGTGAA 
AsnArgThr CeuThr LeuLeuSe rValThrArgAsnAspThrGlyProTy rGluCy sGlu 



730 750 770 

• * • • • 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
I leClnAsnProValSe rAlaAsnArgSerAspProValThcLeuAsnValThcTy rGly 



790 810 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
ProAspThcProThcl leSe rProSe rAspThrTy rTy rArgProGlyAlaAsnLeuSe r 

850 870 890 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGGAACA 
LeuSe cCy sTy rAlaAlaSe rAsnProProAlaGlnTyrSerTrpLeuIleAsnGlyThr 



910 930 950 

TTCCAGCAAAGCACACA^AGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PheGlnGlnSe rThcGlnGluLeuPhe I lePcoAsnl leThrVal AsnAsnSe cGlySe r 

970 990 1010 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
TyrThrCysHl s Ala AsnAsnSe c valTh rGlyCysAsnAcgTh cTh rVal Ly sTh r I le 
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1030 1050 1070 

ATAGTCACTGATAATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGCCCATTGCTGGC 
5 IleValThrAspAsnAlaLeuProGlnGluAsnGlyLeuSe cProGlyAlalleAlaGly 

1090 1110 1130 

• • • • • • 

10 ATTGTGATTCCAGTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTG 
I leva II leClyValValAlaLeuValAlaLeuIleAlavalAlaLeuAlaCysPheLeu 

1150. 1170 1190 

75 CATTTCGGGAAGACCGGCAGCTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGA 
Hi sPheGlyLysThrGlySe rSe rGlyProLeuGln 

12-10 1230 1250 

20 ..... 

AGTTACTTATTCTACCCTGAACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTC 

1270 1290 1310 

25 CCCATCCCTAACAGCCACAGAAATA>.TTTATTCAGAAGTAAAAAAGCAGTAATGAA.^CCT 

1330 

~ n GAAAAAAAAAAAAAAAAAA 



The present invention is also directed to a replicable recombinant cloning vehicle ("vector") having an 
insert comprising a nucleic acid, e.g., DNA, which comprises a base sequence which codes for a CEA 
35 peptide or a base sequence hybridizable therewith. 

This invention also relates to a cell that is transformed/transfected, infected or injected with the above 
described replicable recombinant cloning vehicle or nucleic acid hybridizable with the aforementioned 
cDNA. Thus the invention also concerns the transfection of cells using free nucleic acid, without the use of 
a cloning vehicle. 

40 Still further, the present invention concerns a polypeptide expressed by the above described transfec- 
ted, infected or injected cell, which polypeptide exhibits immunological cross-reactivity with a CEA, as well 
as labelled forms of the polypeptide. The invention also relates to polypeptides having an amino acid 
sequence, i.e., synthetic peptides, or the expression product of a cell that is transfected, injected, infected 
with the above described replicable recombinant cloning vehicles, as well as labelled forms thereof. Stated 

45 otherwise, the present invention concerns a synthetic peptide having an amino acid sequence correspond- 
ing to the entire amino acid sequence or a portion thereof having no less than five amino acids of the 
aforesaid expression product. 

The invention further relates to an antibody preparation specific for the above described polypeptide. 
Another aspect of the invention concerns an immunoassay method for detecting CEA or a functional 

so equivalent thereof in a test sample comprising 

(a) contacting the sample with the above described antibody preparation, and 

(b) determining binding thereof to CEA in the sample. 

The invention also is directed to a nucleic acid hybridization method for detecting a CEA or a related 
nucleic acid (DNA or RNA) sample in a test sample comprising 
55 (a) contacting the test sample with a nucleic acid probe comprising a nucleic acid, which comprises a 
base sequence which codes for a CEA peptide sequence or a base sequence that is hybridizable 
therewith, and 

(b) determining the formation of the resultant hybridized probe. 
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The present invention also concerns a method for detecting the presence of carcinoembryonic antigen 
or a functional equivalent thereof in an animal or human patient in vivo comprising 

a) introducing into said patient a labeled (e.g., a radio-opaque material that can be detected by X-rays, 
radiolabeled or labeled with paramagnetic materials that can be detected by NMR) antibody preparation 
according to the present invention and 

b) detecting the presence of such antibody preparation in the patient by detecting the label. 

In another aspect, the present invention relates to the use of an antibody preparation according to the 
present invention for therapeutic purposes, namely, attaching to an antibody preparation radionuclides, 
toxins or other biological effectors to form a complex and introducing an effective amount of such complex 
into an animal or human patient, e.g., by injection or orally. The antibody complex would attach to CEA in a 
patient and the radionuclide, toxin or other biological effector would serve to destroy the CEA expressing 
cell. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a schematic representation of the transmembrane CEA's 

DETAILED DESCRIPTION OF THE INVENTION 

In the parent application 87111/68, published as EP-A-263 933, applicants described the following 
CEA's: 





ATCC No. 


CEA-(a) partial CEA (pcLV7) 
CEA-(b) full coding CEA (pc 15LV7) 
CEA-(c) TM-1 (FL-CEA; pc 19-22) 
CEA-(d) NCA (pcBT 20) 


67709 
67710 
67711 



In the present application, applicants described the following CEA's: 





ATTC No. 


CEA-(e) TM-2 (pc E22) 
CEA-(f) TM-3 (pc HT-6) 
CEA-(g) TM-4. 


67712 
67708 



ATCC Nos. 67708, 67709, 67710, 67711 and 67712 were all deposited with the American Type Culture 
Collection on May 25, 1988. 
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The sequences for CEA-(a), CEA-(b), CEA-(c) and CEA-(d) are given hereinbelow: 



CEA-(a) : 



70 



75 



20 



25 



30 



35 



OG OCT TTA CAC AAC CAC CAC OCC ATC AAA OOC TTC ATC AOC AOC AAC AAC TX AAC OCC GTG 
GAG GAT GAG GAT OCT GTA OOC TTA AOC TCT GAA OCT GAG ATT CAG AAC ACA AOC TAC CTG 
TGG TOG GTA AAT AAT CAG AOC CTC COG CTC ACT OOC AGG CTG CAG CTG TOC AAT GAC AAC 
AOG AOC CTC ACT CTA CTC ACT GTC ACA AOG AAT GAT GTA OGA OCC TAT GAG TGT GGA ATC 
CAC AAC GAA TTA ACT CIT GAC CAC AGC GAC OCA G1C ACC CAG OGA TIC CTC TAT OOC OCA 
GAC GAC OOC AOC ATT TOC OOC TCA TAC ACC TAT TAC OCT OCA OX GTG GAA OCT CAG CCT 
CTC TOC CAT OCA OCC TCT AAC OCA OCT OCA CAG TAT 1CT TGG CTC ATT GAT GOG ACC CIC 
CAG CAA CAC ACA CAA GAG CIC TTT ATC TOC AAC ATC ACT GAG AAC AAC AOC OGA CTC TAT 
ACC TGC CAG OCC AAT AAC TCA OCC ACT GOC ACA OCA GGA CTA CAG TCA AGA CAA TCA CAG 
TCT CTG COG ATG OX AAG COC TOC ATC TOC AOC AAC AAC TOC AAA OOC CTG GAG GAC AAG 
CAT COC TCT OOC CTT CAC TCT GAA OCT GAG OCT CAG AAC ACA AOC TAC CTG TCC TOG GTA 
AAT OCT CAG AOC CTC CCA CIC ACT OCC AOG CTG CAG CIC TOC AAT OGC AAC AOG AOC CIC 
ACT CTA TTC AAT CIC ACA AGA AAT GAC OCA AGA OOC TAT CTA TCT GGA ATC CAG .AAC TCA 
GTG ACT OCA AAC OOC ACT GAC OCA GTC ACC CIC CAT GTC CTC T*.T GOG COG GAC AOC COC 
ATC ATT TOC CCC COC CC 



40 



45 



(b) 



10 



20 



30 



40 



50 



C ACC ATG GAG TCT CCC TCG GCC CCT CTC CAC AgTtGG TGC ATC CCC TGG CAG AGG CTC 
Met Glu Ser Pro Ser Ala Pro Leu Mi, Arg Trp Cys He Pro Trp Gin Arg Leu 



50 



55 
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( 



60 ™ 80 90 100 no 

CTG CTC ACA GCC TCA CTT CTA ACC TTC TGG AAC CCG CCC ACC ACT GCC AAG CTC ACT 
5 Leu Leu Thr AU Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr Thr Ala Lys Leu Thr 

1 2 3 



120 130 140 

• • 

ATT GAA TCC ACG CCG TTC AAT CTC GCA GAG 
He Glu Ser Thr Pro Phe Asn Val Ala Glu 
4 5 6 7 8 9 10 11 12 13 



150 160 170 

■ ■ . 

GGG AAG GAG GTG CTT CTA CTT GTC CAC 
Gly Lys Glu Val Leu Leu Leu Val His 
14 15 16 17 18 19 20 21 22 



75 180 190 200 210 220 

AAT CTG CCC CAG CAT CTT TTT GGC TAC AGC TGG TAC AAA GGT GAA AGA GTG GAT GGC 

Asn Leu Pro Gin His Leu Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly 

23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 

20 

230 240 250 260 270 280 

AAC CGT CAA ATT ATA GGA TAT GTA ATA GGA ACT CAA CAA GCT ACC CCA GGG CCC GCA 
Asn Arg Gin He lie Gly Tyr Val He Gly Thr Gin Gin Ala Thr Pro Gly Pro Ala 
25 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 



290 300 310 320 330 340 

TAC ACT GGT CGA GAG ATA ATA TAC CCC AAT GCA TCC CTG CTG ATC CAG AAC ATC ATC 
Tyr Ser Gly Arg Glu He He Tyr Pro Asn Ala Ser Leu Leu He Gin Asn He lie 
61 62. 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 



350 360 370 380 390 400 

35 " ... 

CAG AAT GAC ACA GGA TTC TAC ACC CTA CAC GTC ATA AAG TCA GAT CTT GTG AAT GAA 
Gin Asn Asp Thr Gly Phe Tyr Thr Leu His Val He Lys Ser Asp Leu Val Asn Glu 
80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 



4 !0 420 430 440 450 

GAA GCA ACT GGC CAG TTC CCG GTA TAC CCG GAG CTG CCC AAG CCC TCC ATC TCC AGC 

Glu Ala Thr Gly Gin Phe Arg Val Tyr Pro Glu Leu Pro Lys Pro Ser lie Ser Ser 

99 101 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 



460 470 480 

■ • 

AAC AAC TCC AAA CCC GTG GAG GAC AAG GAT 

Asn Asn Ser Lys Pro Val Glu Asp Lys Asp 

118 119 120 121 122 123 124 125 126 127 



490 500 S10 

• • • 

GCT GTG GCC TTC ACC TGT GAA CCT GAG 
Ala Val Ala Phe Thr Cys Glu Pro Glu 
128 129 130 131 132 133 134 135 136 
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520 530 540 550 560 570 

• - • • • 

ACT CAG GAC OCA XCC TAC CTG TGG TGG GTA AAC AAT CAG AGC CTC CCG GTC AGT CCC 

5 Thr Gin Asp Ala Thr Tyr Leu Trp Trp Val Asn A$n Gin Ser Leu Pro Val Scr Pro 

137 138 139 140 141 142 143 144 14$ 146 147 148 149 150 151 152 153 154 155 



S80 590 600 610 620 

• ■ • • • 

AGG CTG CAG CTG TCC AAT GGC AAC AGG ACC CTC ACT CTA TTC AAT GTC ACA AGA AAT 
Arg Leu Gin Leu Ser Asn 61 y Asn Arg Thr Leu Thr Leu Phe Asn Val Thr Arg Asn 
156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 



75 630 640 650 660 670 680 

• - mm* « 

GAA CAA GCA AGC TAC AAA TGT GAA ACC CAG AAC CCA GTG AGT GCC AGG CGC AGT GAT 

Glu Gin Ala Ser Tyr Lys Cys Glu Thr Gin Asn Pro Val Ser AH Arg Arg Ser Asp 

175 175 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 

20 

690 700 710 720 730 740 

• « ■ « * * 

TCA GTC ATC CTG AAT GTC CTC TAT GGC CCG GAT GCC CCC ACC ATT TCC CCT CTA AAC 
Ser Val He Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro Thr He Ser Pro Leu Asn 
25 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 



750 ^ 760 770 

• • • 

ACA TCT TAC AGA TCA GGG GAA AAT CTG AAC 

Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn 

213 214 215 216 217 218 219 220 221 222 



780 790 

■ m 

CTC TCC TGC CAC GCA GCC TCT AAC CCA 
Leu Ser Cys His Ala Ala Ser Asn Pro 
223 224 225 226 227 228 229 230 231 



800 810 820 830 840 850 

mm* mm * 

CCT GCA CAG TAC TCT TGG TTT GTC AAT GGG ACT TTC CAG CAA TCC ACC CAA GAG CTC 
Pro Ala Gin Tyr Ser Trp Phe Val Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu 
232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 



860 870 880 890 900 910 

• • « • • • 

TTT ATC CCC AAC ATC ACT GTG AAT AAT AGT GGA TCC TAT ACG TGC CAA GCC CAT AAC 
Phe He Pro Asn He Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gin Ala Hit Asn 
251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 



920 930 940 950 960 970 

m m m • « ■ 

TCA GAC ACT GGC CTC AAT AGG ACC ACA GTC ACG ACG ATC ACA GTC TAT GCA GAG CCA 

50 Ser Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr lie Thr Val Tyr Ala Glu Pro 

270 271 272 273 274 Z75 276 277 278 279 280 281 262 283 284 285 286 287 288 
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(' 



980 990 1000 1010 1020 

m m m m m 

CCC AAA CCC TTC ATC ACC AGC AAC AAC TCC AAC CCC GTG GAG GAT GAG GAT GCT GTA 
Pro lys Pro Phe He Thr Ser Asn Asn Ser Asn Pro Val Glu Asp Glu Asp Ala Val 
289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 



1030 1040 1050 

■ • * 

GCC TTA ACC TGT GAA CCT GAG ATT CAG AAC 
Ala Leu Thr Cys Glu Pro Glu lie Gin Asn 
308 309 310 311 312 313 314 315 316 317 



1060 1070 1080 

mm* 

ACA ACC TAC CTG TGG TGG GTA AAT AAT 
Thr Thr Tyr Leu Trp Trp Val Asn Asn 
318 319 320 321 322 323 324 325 326 



15 



1090 



1100 



1110 



1120 



1130 



1140 



20 



CAG AGC CTC CCG GTC AGT CCC AGG CTG CAG CTG TCC AAT GAC AAC AGG ACC CTC ACT 
Gin Ser Leu Pro Val Ser Pro Arg Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr 
327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 



1150 



1160 



1170 



1180 



1190 



25 



CTA CTC AGT GTC ACA AGG AAT GAT GTA GGA CCC TAT GAG TGT GGA ATC CAG AAC GAA 
Leu Leu Ser Val Thr Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly lie Gin Asn Glu 
346 347 348 349 350 351 352 353 354 355 356 357 3S8 359 360 361 362 363 354 



1200 1210 1220 1230 1240 1250 

• * - « ■ « 

TTA AGT GTT GAC CAC AGC GAC CCA GTC ATC CTG AAT GTC CTC TAT GGC CCA GAC GAC 
Leu Ser Val Asp His Ser Asp Pro Val He Leu Asn Val Leu Tyr Gly Pro Asp Asp 
365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 



1260 1270 1280 1290 1300 1310 

« ■ » ■ • • 

CCC ACC ATT TCC CCC TCA TAC ACC TAT TAC CGT CCA GGG GTG AAC CTC AGC CTC TCC 

Pro Thr He Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu Ser Leu Ser 

384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 



1320 1330 1340 

mm m 

TGC CAT GCA GCC TCT AAC CCA CCT GCA CAG 
Cys His Ala Ala Ser Asn Pro Pro Ala Gin 
403 404 405 406 407 408 409 410 411 412 



1350 1360 
« ■ 

TAT TCT TGG CTG ATT GAT GGG AAC ATC 
Tyr Ser Trp Leu He Asp Gly Asn lie 
413 414 415 416 417 418 419 420 421 



1370 1380 1390 1400 1410 1420 

CAG CAA CAC ACA CAA GAG CTC TTT ATC TCC AAC ATC ACT GAG AAG AAC AGC GGA CTC 

Gin Gin His Thr Gin Glu Leu Phe lie Ser A$n lie Thr Glu Lys Asn Ser Gly Leu 

422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 



55 
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1430 1440 1450 

• m m 

TAT ACC TGC CAG GCC AAT AAC TCA GCC ACT 
Tyr Thr Cys Gin Ala Asn Asn Ser Ala Ser 
441 442 443 444 445.446 447 448 449 450 



1460 1470 1480 

• • • 

GGC CAC AGC AGG ACT ACA GTC AAG ACA 

Gly His Ser Arg Thr Thr Val Lys Thr 

451 452 453 454 455 456 457 458 459 







1490 
■ 


1500 
• 


10 


ATC 


ACA 


GTC TCT GCG 


GAC GTG CCC 




He 


Thr 


Val Ser Ala 


Asp Val Pro 




460 


461 


462 463 464 


465 466 467 


15 






1550 
« 


1560 
• 




GTG 


GAG 


GAC AAG GAT 


GCT GTG GCC 




Val 


Glu 


Asp Lys Asp 


Ala Val Ala 




479 


480 


481 482 483 


484 48S 486 


20 












1600 
• 


1610 
* 


1620 
■ 




TAC 


CTG 


TGG TGG GTA 


AAT GGT CAG 




Tyr 


Leu 


Trp Trp Val 


Asn Gly Gin 


25 


498 


499 


500 501 502 


503 504 505 



1510 1520 1530 1540 

• • • ■ 

AAG CCC TCC ATC TCC AGC AAC AAC TCC AAA CCC 

Lys Pro Ser lie Ser Ser Asn Asn Ser Lys Pro 

468 469 470 471 472 473 474 475 476 477 478 



1570 1580 1590 

■ a « 

TTC ACC TGT GAA CCT GAG GCT CAG AAC ACA ACC 
Phe Thr Cys Glu Pro Glu Ala Gin Asn Thr Thr 
487 488 489 490 491 492 493 494 495 496 497 



1630 1640 1650 

■ « • 

AGC CTC CCA GTC AGT CCC AGG CTG CAG CTG TCC 
Ser Leu Pro Val Ser Pro Arg Leu Gin Leu Ser 
506 507 508 509 510 511 512 513 514 SIS 516 



1660 1670 1680 

■ • • 

AAT GGC AAC AGG ACC CTC ACT CTA TTC 
Asn Gly Asn Arg Thr Leu Thr Leu Phe 
517 518 519 520 521 522 S23 S24 525 



1690 1700 1710 







■ 




■ 




■ 


AAT 


GTC 


ACA 


AGA AAT 


GAC GCA AGA 


GCC 


TAT 


Asn 


Val 


Thr 


Arg Asn 


Asp Ala Arg 


Ala 


Tyr 


526 


527 


528 


529 530 


531 532 533 


534 


535 



1720 1730 1740 1750 1 760 



35 





• 




* • 






■ 




■ 






GTA 


TGT GGA ATC 


CAG AAC 


TCA GTG AGT GCA 


AAC 


CGC 


AGT 


GAC 


CCA GTC ACC 


CTG 


GAT 


Val 


Cys Gly He 


Gin Asn 


Ser Val Ser Ala 


Asn 


Arg 


Ser 


Asp 


Pro Val Thr 


Leu 


Asp 


536 


537 538 S39 


540 541 


542 543 544 545 


546 


547 


548 


549 


550 551 552 


553 


554 



40 



1770 



1780 



1790 



1800 



1810 



1820 



GTC CTC TAT GGG CCG GAC ACC CCC ATC ATT TCC CCC CCA GAC TCG TCT TAC CTT TCG 
Val Leu Tyr Gly Pro Asp Thr Pro He lie Ser Pro Pro Asp Ser Ser Tyr Leu Ser 
555 556 557 558 5S9 560 561 S62 563 564 565 566 567 568 569 570 571 572 573 



45 



1830 



1840 



1850 



1860 



1870 



1880 



50 



GGA GCG AAC CTC AAC CTC TCC TGC CAC TCG GCC TCT AAC CCA TCC CCG CAG TAT TCT 
Gly Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gin Tyr Ser 
574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 S91 592 



55 
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1890 

m 




1900 
• 


1910 1920 




1930 




TGG 


CGT ATC AAT 


GGG 


ATA CCG GAG 


CAA CAC ACA CAA GTT CTC 


TTT 


ATC 


GCC AAA ATC 




Trp 


Arg He Asn 


Gly 


He Pro Gin 


Gin His Thr Gin Val Leu 


Phe 


He 


Ala Lys He 


5 


593 


594 595 596 


59/ 


598 599 600 

mr^m -+ J J WWW 


601 602 603 604 60S 606 

WW I WW4» VIVV Uw~ WW WWW 


607 


608 


609 610 611 




■ 


1950 

m 




1960 
« 


1970 1980 

• * 






1990 
■ 


10 


ACG 


CCA AAT AAT 


AAC 


GGG ACC TAT 


GCC TGT TTT GTC TCT AAC 


TTG 


GCT 


ACT GCC CGC 




Thr 


Pro Asn Asn 


Asn 


Gly Thr Tyr 


A1* Cys Phe Val Ser Asn 


Leu 


Ala 


Thr Gly Arg 




612 


613 614 615 


616 


617 618 619 


620 621 622 623 624 625 


626 


627 


628 629 630 



75 



20 



25 



30 



35 



40 



45 



2000 2010 2020 2030 2040 2050 





■ 




• 


• mm 










AAT 


AAT 


TCC ATA 


GTC AA6 AGC 


ATC ACA GTC TCT GCA TCT GGA ACT 


TCT 


CCT 


GGT 


CTC 


Asn 


Asn 


Ser He 


Val Lys Ser 


He Thr Val Ser Ala Ser Gly Thr 


Ser 


Pro 


Gly 


Leu 


631 


632 


633 634 


635 636 637 


638 639 640 641 642 643 644 645 


646 


647 


648 


649 




2060 
* 


2070 
• 


2080 2090 2100 

m mm 






2110 
* 


TCA 


GCT 


GGG GCC 


ACT GTC GGC 


ATC ATG ATT GGA GTG CTG GTT GGG 


GTT 


GCT 


CTG 


ATA 


Ser 


Ala 


Gly Ala 


Thr Val Gly 


lit Het He Gly Val Leu Val Gly 


Val 


Ala 


Leu 


He 


650 


651 


652 653 


654 655 656 


657 658 659 660 661 662 663 664 


665 


666 


667 


668 






2120 
• 


2130 
* 


2140 2150 

• m 


2160 






TAG 


CAG 


CCC TGG 


TGT AGT TTC 


TTC ATT TCA GGA AGA CTG ACA GTT 


GTT 


TTG 


CTT 


CTT 



2170 2180 2190 2200 2210 2220 

" • m m mm 

CCT TAA AGC ATT TGC AAC AGC TAC AGT CTA AAA TTG CTT CTT TAC CAA GGA TAT TTA 
2230 2240 2250 2260 2270 2280 

■ ■ • m mm 

CAG AAA ATA CTC TGA CCA GAG ATC GAG ACC ATC CTA GCC AAC ATC GTG AAA CCC CAT 
2290 2300 2310 2320 2330 

■ • » • m 

CTC TAC TAA AAA TAC-AM AAT GAG CTG GGC TTG GTG GCG CGC ACC TGT AGT CCC AGT 
2340 2350- 2360 2370 2380 2390 

« * • ■ • m 

TAC TCG GGA GGC TGA GGC AGG AGA ATC GCT TGA ACC CGG GAG GTG GAG ATT GCA GTG 



50 



55 
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2400 2410 2420 2430 2440 2450 

• ■ m m m « 

AGC CCA GAT CGC ACC ACT GCA CTC GAG TCT GGC AAC AGA GCA AGA CTC CAT CTC AAA 



70 



75 



20 



2460 2470 2480 2490 2500 

• • m m m 

AAG AAA AGA AAA GAA GAC TCT GAC CTG TAC TCT TGA ATA" CAA GTT TCT GAT ACC ACT 

2510 2520 2530 2540 2550 2560 

■ • ■ • ■ • • 

GCA CTG TCT GAG AAT TTC CAA AAC TTT AAT GAA CTA ACT GAC AGC TTC ATG AAA CTG 

2570 2580 2590 2600 2610 2620 

• • ■ • « m 

TCC ACC AAG ATC AAG CAG AGA AAA TAA TTA ATT TCA TGG GGA CTA AAT GAA CTA ATG 

2630 2640 2650 2660 2670 2680 

■ • • m m m 

AGG ATA ATA TTT TCA TAA TTT TTT ATT TGA AAT TTT GCT GAT TCT TTA AAT GTC TTG 



25 



30 



2690 2700 2710 2720 2730 

• • ■ mm 

TTT CCC AGA TTT CAG GAA ACT TTT TTT CTT TTA AGC TAT CCA CTC TTA CAG CAA TTT 

2740 2750 2760 2770 2780 2790 

• m m m mm 

GAT AAA ATA TAC TTT TGT GAA CAA AAA TTG AGA CAT TTA CAT TTT ATC CCT ATG TGG 



35 



2800 2810 2620 2830 

* m m m 

TCG CTC CAG ACT TGG GAA ACT ATT CAT GAA TAT TTA TAT TGT ATG 



40 



45 



50 



55 
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CEA-(c): 

10 30 50 

CAGCCGTCCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGACAGGGCCA 

70 90 110 

GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
MetGlyHisLeuSecAlaProLeuHisArgValArgValProTrpGln 

130 150 170 

• • • • • • 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuThrAlaSe TLeuLeuThrPheTrpAsnProProThrThrAlaGlnLeu 

190 210 230 

ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSerMetProPheAsnValAlaGluGlyLysGluValLeuLeuLeuValHis 

250 270 290 

• • • • ■ • 
AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
AsnLeuProGlnGlnLeuPheGlyTyrSerTrpTyrLysGlyGluArgValAspGlyAsn 

310 330 350 

CGTCAAATTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
ArgGlnlleV^lGlyTyrAlalleGlyThrGlnGlnAlaThrProGlyProAlaAsnSer 

370 390 410 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAGAATGAC 
GlyArgGluThrlleTyrProAsnAlaSerLeuLeuIleGlnAsnValThrGlnAsnAsp 

430 450 470 

• • • • • • 

ACAGGATTCTACACCCTACAAGTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGA 

ThrGlyPheTyrThrLeuGlnVallleLysSerAspLeuValAsnGluGluAlaThrGly 
490 510 530 
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(2) 

10 30 50 

CAGCCG7GC7CGAAGCG7TCCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGACAGGGCCA 

70 90 110 

- 

GCAGGAGACACCA7GGGGCACCTCTCAGCCCCACTTCACAGAGTGCG7GTACCCTGC-CAG 
MetGlyKi sLeuSerAla?roLeuHisArgValArgVel?roTrpGln 



130 150 170 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuThrAlaSe rLeuLeuThcPheTrpAsnProProThrThrAlaGlnLeu 



190 210 230 

* 

ACTAC7GA-ATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTJICTTGTCCAC 
ThrThrGluSe rnetProPheAsnValAlaGruGlyLysGluValLeuLeuLeuVelHi s 



250 270 290 

A^7CTGCCCCAGC.\ACTTTTTGGCTACAGCTGGTACA>^GGGGAAJ^GAG7GGATGGCAAC 
30 A s p. L e u ? i* o G 1 r. G 1 n 1 e 'j ? h e G I y T y r S e r7rp7y rLysGlyGluArgVaiAspGiyAsn 



310 330 350 

• • 

35 CG7CA^A77G7AGGA7A7GCAATAGGA^C7CAACAAGCTACCCCAGGGCCCGCA^J%CAGC 
ArcGln:leValGly7yrAlalleGly7hrGlnGlnAla7hr?roGly?rc-AleAsnSe r 



370 3S0 410 

40 

GG7CGAGAGAC^7A7ACCCCAA7GCATCCCTGC7GA7CCAGAuACG7CACCCAGAJ^.7GAC 
GlyArgGlu7hr Ilc7yrProAsnAlaSerLeuLeuIleGlnAsnVal7hrGlr.AsnAsp 



45 



50 



55 
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430 <50 470 

ACAGGATTCTACACCCTACXAGTCA7AAACTCACATCTTGTGAATGAAGAAGCAACTGGA 
ThrGly?heTyrTh:Le*jGlnVallleLysSerAspLeuValAsnGluGluAldThrGly 



CSC 510 530 

» * ■ • 

CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 

Gln?heHi svalTy rProGluLeuProLysPr oSc r 1 leSe rSe r AsnAsnSe rAsnPro 



SSC 570 S90 

GTGGAGGACAAGGA7GCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
valGluAspLysAspAlaValAla?he7hrCysGluFroGlu7hrClnAsp7hr7hr7yr 



610 630 650 

20 CTGTGCTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuTrpTrplieAsnAsnGlnSerLeuPtoValSerProArgLeuGlnLeuSerAsnGly 



6/0 690 710 

25 

AACAGGACCC7CAC7C7AC7CAG7G7CACAAGGAA7GACACAGGACCC7A7GAG7G7GAA 
AsnArgThrLsjThrLeuLeuSerValThrArgAsnAspThrGlyProTyrGluCysGlu 



730 750 770 

ATACAGAACCCAGTGAGTCCGAACCGCAG7GACCCAGTCACCTTGAATGTCACC7A7GGC 
IlcGlnAsnProValSerAlaAsnArgSerAsp?roVal7hrLeuAsnVai7hr7yrGly 



35 7S0 810 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACC7CAGC 
ProAspThrProThrl leSe cPcoSe rAspThrTy rTy rArgProGlyAlaAsnLcuSe r 

40 



45 



50 



55 



•71 
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w 



550 870 8S0 

C7CTtC7GC?A7CC. t .GCC7C7AACCCACCTGCACAG7AC7CC7GCC77ATCAJv7GG.-.ACA 
LeuSe r Cy'sTyTAl eAlaSe rArmPro»roAl eGlnTy rSerT r pLeu-I 1 eA-srvGlyT'n r 

5 

SID 930 9S0 

TTCCAGCA.-AGCACA.CAAGAGC7CTTTATCCCTAACATCACTGTGAA7AA.7AGTGGATCC 
PheGlnGlnSe r 7h rGlnGluLeuPhe 1 1 ePr oAsnl 1 eTh rVal AsnAsnSe r Giy Se r 

970 990 1010 

♦ ' m • . „ 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
75 Ty rThrCysHi sAiaAsnAsnSe rValThrGlyCysAsnArgThrThrValLysThr I le 

1030 1050- 1070 

• • • . . 

20 ATAGTCACTGAGCT.^AGTCCAGTAGTAGCAAAGCCCCA^J^TCAAAGCCAGCAAGACCACA 
I 1 eVa 1 Th rGiuLeuSerPr oVa 1 Va 1 Al aLy s? r oGlnl le LysAl aSe r Ly sTh r Th r 

1C90 1110 1130 

25 • • ...... . 

GTCACAGGAGATAAGGACTCTGTGAA.CCTGACC7GCTCCACAAATGACAC7GGAATCTCC 
VelThrGlyAspLysAspSe rVal AsnLcuThr CysSe rThrAsnAspThrGly I leSe r 



30 



1150 1170 1190 

• • • . . • 

ATCCGTTGG77CT7CAAAAACCACAGTCTCCCGTCCTCGGAGAGGATGAAGCTGTCCCAG 
IleArg7rpPhe?heLysAsnGlnSerLeuProSe r Se rGluAr gtte tLy s LeuSe rGl n 

35 1210 1230 1250 

• - . 

GGCAACACCACCCTCAGCATAAACCCTGTCAAGAGGGAGGATGCTGGGACGTATTGGTGT 
GlyAsnThrThrLeuSe rllcAsnProValLysArgGluAspAlaGlyThrTy rTrpCys 

40 



45 



50 



55 
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1270 1290 1310 

GACG7C7TCAACCCAA7CAGTAAGAACCAAAGCGACCCCA7CA7GC7GAACG7AAACTA7 
Gl'jVe 1 ?heA5T:?ro! 1 eSe rLysAsnGlnSe rAsp?r ol 1 ene t-euAsnvalAsnTy r 

i33C 1350 1370 

AA7GC7C7AC CACAACAAAA7GGCCTC7CACC7GGGGCCA77GC7GGCA77G7GA77GGA 
AsnAl a Lev; ? r oGi nGluAs nGl yLeuSe r ? r oGl yAl a 1 1 eAlaGl y I leVallleGly 

1390 1410 1430 

GTAGTGGCCCTGG7TGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTGCATT7CGGGAAG 
valvalAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeuHisPheGlyLys 

1450 1470 1490 

ACCGGCAGC7CAGGACCAC7CCAA7GACCCACC7AACAAGA7GAA7GAAG77AC77A77C 
7h rC-lySe r Se rGly? roLeuGln 

1510 1530 1550 

7ACCC7GAAC777GAAGCCCAGCAACCCACACAACCAAC77CAGCC7CCCCA7CCC7AAC 

1570 1590 1610 

A C C C A C AG AAA7AA 777A77C AG AAG7AAAAAAGC A G 7 AA TG AAA C C 7G AAAAAAAAAAA 



1630 
AAAAAAAAAA 
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(3) 



10 30 50 

CAGCCGTGCTCG^GCGTTCCTCGACCCCKACCTCTCCTCCACACGTC^CACAGGGCCA 

10 7 0 90 110 

GC AGCACACACCA7CGGGCACCTC7CAGCCCCACT7CACAGAC7GCG7G7AC CC7GGC AC 
KetGiyKi sLeuSerAlaProLeuHisArgValArgval ? roTrpG) r\ 

15 

130 150 170 

CGGCTTCTCCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTCCCCACCTC 
GlyLeoLeuLeuThrAl aSe c LeuLeuTh r FheTrpAsn? r oP r oTh c Th r A 1 aC 1 nLe u 

20 

190 210 230 

AC7AC7CAA7CCA7CCCA77CAATG77CCACACGGGAAGGAGC77C7TCTCC77G7CCAC 



25 



ThrThrGluSe crte t? r oPheAsnVa 1 Ai aGluGiytysGluVa 1 LeuLeuLeuValH i s 

250 270 290 

AATCTGCCCCAGCA.-XTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGC/^.C 
50 AsnLeu?rcSinCir.Leu?heGlyTy rSe rTrpTy rLysGlyGluArcVa lAspGlyAsr. 

3iC- 330 350 

35 CG7CAAA77C7AGGA7A7CCAA7ACGAAC7CA^C;^GC7AC 

Ar^Glnl 1 evalGiyTyrAlal 1 8Gly7hrGlnGlnAla7hr?roGly?roAl eAsnSe r 

370 390 <10 

40 . . . . . 

GC7CCAGAGACA^7A7ACCCCAAVCCA7CCCTGC7GA7CCAGAACG7CACCCAG;s.A7GAC 
GlyArgGlu7hr Ile7y rProAsnAlaSe rLeuLeu! leGlnAsnval7hcCl.nAsr.As? 



45 *50 $70 

ACAGGA77C TACACCC7ACAA.GTCA7AAAG7CACATC TTGTGAA7GAAGAAGCAACTGGA 
7h rG 1 y Phe 7y r 7h r LeuGl nval 1 1 e Ly s Se r AspLeuVa 1 AsnGl uGl uAl a7h r G 1 y 

50 



55 



( 
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<o 0 510 530 

CACTTCCATCTA7ACCCCGAGCTGCCCAAGCCCTCCA7CTCCAGCA>.CAACTCCA.ACCC7 
GlnPheHi s Vi l?y r ? roGluLeu? r oLys ? r oSe r IleSerSerAsnAsnS'e r Asp.?' r"o" 

550 570 590 

G7GGAGCACAAGGA7GC7G7GCCC77CACC7G7GAACCTGAGAC7CAGGACACAACC7AC 
VaiGluAsoLysAspAlaValAlePhe7hrCysCluProGlu7hrGlnAsp7hr7hr7y r 

610 630 650 

. . » • • 

C7G7CG7GGA?AAACAA7CAGACCC7eCCGG7CAG7CCCAGGC7GCAGC7C7CCAA7GGC 
Leu7rp7rp! ieAsnAsnGlnSe r LeuPcovalSerProAroLeuGlnLeuSe rAsnGly 

670 690 " 710 

AACAGGACCC7CAC7CTAC7CAG7G7CAC/^GCAATGACACAGGACCC7A7GAGTGTGA.s> 
Asr.Af g7hr Le<j7hrLeuLeuSe rVal7hrAr gAsnAsp7hrGly P ro7y rGluCysGlu 



/ s J 



750 770 



A7ACAGAACCCAC7GAC7GCGAACCGCAG7CACCCAG7CACC77GAA7G7CACC7A7GGC 
I leClnAsnProValSerAlaAsnArgSerAspProValThrLeuAsnvalThrTyrGly 



19o 310 B30 

:CGACACCCCCACCA777CCCC77CAGACACC7A77*ACCG7CCAGGGGCAAACC7CAGC 
:sAso7hr ?:o7hr I lrS« r-P-r o-Se r As-pTh-c 7y r 7y r Ar g-P roGi yAl aAsnLs u.Si s. 



S5: 870 690 

rC7CC7CC7A7GCAGCC7C7».CCCACC7GCACAG7AC7CC7GGC77A7CA-=.7GGA.->CA 
;uSe rCvs7vrAlaAlaSe r AsnPro?roAlaGln7y r Se r7rpLeuI 1 eAsnGly7hr 



=10 930 950 

■ • • • 

•7CCAGCAA^CCACACA>.GACCTCTTTATCCCTAACATCACTGTGAATAATAGrCGATCC 
hcGI nG 1 nSe rThrGlnGluLcurhe I IcPcoAsnlleThrVal As nAsnScrGlyScr 

970 990 1010 

7ATAC CTGC C ACGCCAATAACTC AGTC ACTGGCTGC AAC ACGACC AC AGTC AACACG ATC 
TycThcCysHisAlftAsnAsnScrValThcGlyCysAsnAcgThrThrV&lLysThrlie 
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1030 1050 1070 

^TACTCACTCAT^-.TGCTCTACCAC^G/^JVTGGCCTCTCACCTGCCGCCATTGCrCGC 
! iev> IThrAspAsr.AlaLeuProGlnGluAsnGlyLeuSe rProGlyAlal 1 e A 1 a G 1 y 

1090 1110 1130 

ATTGTG ATTGGAGTAGTGGCCCTGGTTGCTC7GATAGC AGTAGC C CTGGC ATGTTTTCTG 
•leva 1 1 1 eGlyVaiValAlaLeuValAlaLeuI leAlaValAlaLeuAl aCysPheLeu 

1150 1170 1190 

CATTTCGGG^AGACCGGCAGCTCAGGACCACTCCAATGACCCACCTAACA/vGATGAATGA 
H i sPheGlyLy s7h rGlySe rSe rClyProLeuGln 

1210 1230 1250 

AGTTACTTATTCTACCCTGAACTTTGAAGCCIIAGCAACCCACACAACCAACTTCAGCCTC 

1270 1290 1310 

CCCATCCCTA^CAGCCACAGA^ATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCT 

1330 

G AAA.AAAAAAAAAAAAAAA 



10 
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(4) 

1 acagcacagctgacagccgtactcaggaagcttctcgatcctaggcttatctccacagag $o 

Si gagaacacacaaccagcagagaccatggggcccctctcagcccctccctccacacacctc 120 

He tGlyProLeuSe rAiaPr oProCysTh rKi sLeu 

•21 a tcectigga^ocggQ.tcctgctcacagcatcacttttaaacttctggaa-tccgcccaca. 18 0 
1 1 eTh rTr pLy sG ly va 1 LeuLeuTh rAl aSe r LeuLeuAs nPheTr pAsnPr o? r oTh r 

131 actccccaactceccattgaeccccagccacccaaagtttctgagggcaagcatcttctt 24 0 
ThrAlaGlnVaiThrlleGluAleGlnProProLysValSerGluGlyLysAspValLeu 

2<1 ctacttgtccacaatttgccccagaatcttgctggctacatttggtacaaagggcaaa tg 300 
LeuLeuValKisAsnLeuProGlnAsnLeuAlaGiyTyrlleTrpTyrLysGlyGlntfe t 

75 

301 aca tacgtctaccattaca ttacatcata tg tag tagacggttcaaagaa ttata tatggg 360 
ThrTyrValTyrHisTyrlleThrSerTyrValValAspGlyGlnArgllcIleTyrGly 

351 cctgcatacactcgaagagaaagagtata ttccaatgcatccctgctga tccagaa tg tc 4 20 
ProAlaTy r Se rGlyArgGluArgValTy rSe rAsnAlaSe rLeuLeuI leGlnAsnVal 

20 

.4 21 acgcaggagga tgcagga tec tacacc t taca-catca taaagegacgega tgggac tgga 4 80 
ThrGlnGluAspAlaGlySerTyrThrLcuHisIlelleLysArgArgAspGlyThrGry 

4 61 ggagraactgcacatttcaccttcaccttacacctgcagactcccaagccctccatctcc 54 0 
GlyValThrGlyHisPheThrPheThrLeuHisLeuGluThrProLysProSerlieSer 

25 

d41 agcaccaacttaaatcccacgcaggccatggacgctgtgatcttaacc tgtgatcctecg 6 00 
SerSerAsnLeuAsn?roArcGluAlaMetGluAlaValileLeuThrCysAsp?rcAia 

501 actccagccgcaagctaccagtggtgcatgaatggtcagagcctccctatgactcacacc 6 60 
ThrProAlaAlaSerTyrGlnTrpTrprietAsnGlyGlnSerLeuPrortetThrHisArg 

06I t tgcagctgtccaaaaccaacacgaccctc t ttata t ttgctgtcacaaag tat attgea 720 
LeuGlnLeuSerLysThrAsnArgThrLeuPhellePheGlyValThrLysTyrlleAla 

721 gcaccctatgaatgtgaaatacgcaacccagtgagtgccacccgcagtgacccactcacc 7 80 
GlyProTyrGluCysGlullcArgAsnProValSerAlaSerArgSerAspProValThr 

35 

7S1 ctgaatctcctcccaaagctgtccaacccctacatcacaatcaacaac ttaaaccccaca 84 0 
LeuAsnLeuLeuProLysLeuSerLysProTyrlleThrlleAsnAsnLeuAsnPrcArg 

841 gacaataaggatgtcttaaccttcacctgtgaacctaagagtgagaactacacctacatt 900 
GluAsnLysAspValLeuThrPheThrCysGluProLysSerGluAsnTyrThrTyrl le 

901 tggtggctaaatggtcagagcctccctgtcagtcccagggtaaagcgaccca ttgaaaac 960 
TrpTrpLeuAsnGlyClnSe r LeuProValSer FrcArgValLysAcgProI leGlvAsn 

961 ogga tec teat tctacccaa t g tcacgagaaa t gaaacaggacc t ta t c aa tg t ga a a t a 10 20 
ArglleLeuI 1 e Leu? r OAs nva lTh r Ar gAsnGluTh r GlyP r oTy r Gl nCy sG 1 u 1 1 e 

45 1021 cgggaccgatatggtggcatccgcagtgacccagtcaccctgaatgtcctctatcgtcca 106 0 
ArgAspArgTy rGlyGlylleArgSerAspProValThrLeuAsnValLeuTy rGlyPro 



50 



55 



7Q 
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1081 


gacctccccagcatttacccttcattcacctattaccgttcaggagaaaacctctacttt 
AspLeu?roSerIleTyrProSerPheThrTyr7yrAcgSerGlyGluAsnLeuTyr?he 


1K0 


1141 


tcctgcttcgctgagtctaacccacgggcacaatattcttggacaattaatgggaagttt 
SerCys?heGlyGluSerAsn?roArgAlaGlnTyrSerTrpThrlleAsnGlyLys?he 


12C0 


1201 


cacctetcacgacaaaagctctctatcccccaaataactacaaagcatagtgggctctat 
GlnLeuSerGlyGlnLysLeuSerlleProGlnlleThrThrLysHisSerGlyLeuTyr 


1 2 5 2 


1261 


octtgctctctccctaactcagccactggcaaggaaagctccaaa tccatcacactcaea 
AlaCysScrValArgAsnSerAlaThrGlyLysGluSerSerLysSerlleThrValLys 


1320 


1321 


gtctctgactggatattaccctgaattctactagttcctccaattccattttctcccatq 
valSerAspTrpIleLeuProEnd 


1350 



^aatcacgaagagcaagacccactctgttccagaagccctataatctggaggtggacaac 144 0 

14 41 tcgatgtaaatttcatgggaaaacccttgtacctgacatgtgagccactcagaactcacc 1500 

loOl aaaatgttcgacaccataacaacagctactcaaactgtaaaccaggataagaagttgatg 1560 

illl IttSSS^ "20 
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1680 
1740 
1800 
I860 
1920 
1980 
2010 
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10 



15 



20 



25 



30 



35 



40 



(5) 



qqqtgqatcctaggctcatctccataggggagaacacacatacagcagagaccatggga 59 
" ' — ' MetGly 



50 cccctctcaccccctccctgcactcagcacatcacctggaaggggctcctgctcacagca 
ProLeuSerAia?ro?roCysThrGlnHisIleThrTrpLysGlyLeuLeuLeuThrAla 



i i o 



45 



120 tcacttttaaacttctggaacctgcccaccactgcccaagtaataattgaagcccagcca 17 S 
SerLeuLeuAsnPheTrpAsnLeuProThrThrAlaGlnValllelleGluAlaGlnPro 

180 cccaaagtttctgaggggaaggatgttcttctacttgtccacaatttgccccagaatctt 239 
ProLysValSerGluGlyLysAspValLeuLeuLeuValHisAsnLeuProGlnAsnLeu 

24 0 actggctacatctggtacaaagggcaaatgacggacctctaccattacattacatcatat 299 
ThrGlyTyrlleTrpTyrLysGlyGlnMetThrAspLeuTy.rHisTyrlleThrSerTyr 

3o0 gtagtagacggtcaaattatatatgggcctgcctacagtggacgagaaacagtatattcc 359 
Valval AspGlyGlnlle I leTy rGlyProAlaTyrSe rGlyArgGluthrValTyrSer 

36 0 aatgcatccctgctgatccagaatgtcacacaggaggatgcaggatcctacaccttacac 419 
AsnAlaSerLeuLeuIleGlnAsnValThrGlnGluAspAlaGlySerTyrThrLeuHis 

4 20 atcataaagcgaggcgatgggactggaggagtfaactggatatttcactgtcaccttatac 479 
IlelleLysArcGlyAspGlyThrGlyGlyValThrGlyTyrPheThrValThrLeuTyr 

4 80 tcggagactcccaagcgctccatctccagcagcaacttaaaccccagggaggtcatggag 53 9 
SerGluThrPrcLysArgSerlleSerSerSerAsnLeuAsnProArgGluValHetGlu 

54 0 gctctgcgcttaatctgtgatcctgagactccggatgcaagctacctgtggttgctgaat 5S9 
AlaValArgLeuIleCysAspProGluThrProAspAlaSerTyrLeuTrpLeuLeuAsn 

6 00 ggtcagaacctccctatgactcacaggttgcagctgtccaaaaccaacaggaccctctat 6 59 

GlyGlnAsnLeuProHetThrKisArgLeuGlnLeuSerLysThrAsnArgThrLeuTyr 

660 ctatttggtgtcacaaagtatattgcagggccctatgaatgtgaaatacggaggggagtg 719 
LeuPheGlyValThrLysTyrllcAlaGlyProTyrGluCysGluIleArgArgGlyVal 

7 20 agtgccagccgcagtgacccagtcaccctgaatctcctcccgaagctgcccatgccttac 779 

SerAlaSerArcSerAspProValThrLeuAsnLeuLeuProLysLeuProKetProTyr 

7 80 atcaccatcaacaacttaaaccccagggagaagaaggatgtgttagccttcacctgtgaa 8 39 
IleThrlleAsnAsnLeuAsnProArgGluLysLysAspValLcuAlaPheThrCysGlu 

840 cctaagagtcggaactacacctacatttggtggctaaatggtcagagcctcccggtcagt 899 
ProLysSerArgAsnTyrThrTyrlleTrpTrpLeuAsnGlyGlnSe rLeuProValSer 

900 ccgagggtaaagcgacccattgaaaacaggatactcattctacccagtgtcacgagaaat 959 
ProArgvalLysArgProIleGluAsnArgI leLeuI leLeuProSe rValThr Ar gAsn 

960 gaaacaggaccctatcadtgtgaadtacgggaccgatatggtggcatccgcagtaaccca 1019 
GluThrGlyProTyrGlnCysGluIleArgAspArgTyrGlyGlylleArgSerAsnPro 
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1020 g tcaccctgaatctcctctatggtccagacc tccccagaatttacccttact tcaccta t 1079 
valThrLeuAsnValLeuTy rGlyProAspLeuProAr gl leTyrProTy rPheThrTy r 

1080 taccgttcagcaaaaaacctcgacttgtcctgctttgcggactctaacccaccggcagag 1139 
5 TyrArgSerGlyGluAsnLeuAspLeuSerCysPheAlaAspSerAsnProProAl aGlu 

114 0 tatttttggacaettaatgggaagtttcagctatcaggacaaaagctctttatcccccaa 1199 
7y rPheTr pThr 1 1 eAsnGly LysPheGlnLeuSe rGlyGlnLysLeuPhel leProGln 



70 



75 



1200 attactacaaatcatagcgggctctatgcttgctctgttcgtaactcagccactggcaag 1259 
IleThrThrAsr.KisSe TGlyLeuTyrAlaCysSerValArgAsnSerAlaThrGlyLys 

1260 gaaatctccaaatccatgatagtcaaagtctctggtccctgccatggaaaccagacagag 1319 
GluIleSerLysSerMetlleValLysValSerGlyProCysHisGlyAsnGlnThrGlu 

1320 tctcattaatggctgccacaatagagacactgagaaaaagaacaggttgataccttcatg 1379 
SerHisEnd 

1380 aaattcaagacaaagaagaaaaaggctcaatgttattggactaaataatcaaaaggataa 14 39 
14 40 tgttttcataatttttattggaaaatgtgctgattcttggaatgttttVttctccagatt .1499 
1500 tatgaactttttttcttcagcaattggtaaagtatacttttgtaaacaaaaattgaaaca 1559 
1560 tttgcttttgctctctatctgagtgccccccc 1591 

20 

2. Replizierbares rekombinantes Kloniervehikel mit einem eine Nucleinsaure nach Anspruch 1 umfassen- 
den Insert. 

25 3. Zelle, die mit einem rekombinanten Kloniervehikel nach Anspruch 2 transfiziert, infiziert Oder injiziert ist. 

4. Verfahren zur Herstellung eines Polypeptids, umfassend die Schritte 

(a) des Kultivierens der Zelle nach Anspruch 3, 

(b) des Gewinnens des durch diese Zelle exprimierten Polypeptids. 

30 

5. Verfahren zur Herstellung eines gegen ein Polypeptid gerichteten Antikorpers, umfassend die Schritte 

(a) des Herstellens des Polypeptids durch das Verfahren des Anspruchs 4, 

(b) des Injizierens des Polypeptids in einen Wirt, der zur Bildung von Antikorpern befahigt ist, und 

(c) des Gewinnens der Antikorper. 



35 



Revendications 



1. Acide nucleique comprenant une sequence de bases qui code pour une sequence peptidique, 
caracterise en ce que le groupe d'acides nucleiques est de TADN choisi parmi le groupe de cinq 
40 sequences ci-apres : 
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:c 30 so 

CAGCCGTCCTCCAACCCTTCCTGGAGCCCAAGCTCTCCTCCACAGCTGAAGACACGCCCA 

5 

70 90 110 

GCACGAGACACCA7GGGGGACG7C7CAGCCCCACTTCACAGAGTGCG7G7AGCC7GGCAG 
10 *necGlyHisLeuSerAla?coLeuHisArgValAcgvalPro7rpGln 

120 150 170 

CGGC77C7GC7CACACCC7CAC77C7AACCr7C7GGAACCCCCCCACCAC7GCCCAGC" C 
75 G 1 v L e - - 2 - LeuT'r. : A 1 a Se r Le'j LeuTh : ? e T : p A s n ? c c ? c c T h : T h r A I c . G : r.L» - 

I9C 210 230 

20 A C T A C 7 G A A 7 C C A 7 G C C A 7 T C AA T G T "I 1 G C A G A G G G G A-A G G A G G 7 7 C 7 7 C 7 C C 7 7 C 7 C C A I 
Th c7h:Glu5 2 r^.e c ? r o?he As r.Ve 1 Al aGluGl yLy sG luVa! Leu Leu te u^a 1 H i s 
2S0 270 290 

AATC7GCCCCAGCA-AC77777GGC7ACACC7GG7ACAA^GGGGA^AGAG7GGA7GC-CAAC 
25 A s n Le u ? r cG 1 .~iG 1 n L e u ? h e G 1 y Ty r S e r 7 r c7y rLysGlyGl u A c g Va I A s pG I y A s r. 



Jin 



330 350 



30 CG7CA^77G7AGGA7A7CCA.-.TAGGAJvCTC;^-XA-AGC7ACCCCACCGCCCGC 

ArcGlnl le /a IGlyTyr Ala I leClyThrClnClnAlaThr ?:cGly?r oAlaAsr.Se r 

370 390 410 

35 . . 

GG7CGACAGACAA7ATACCCCAATGCATCCCTCCTGATGCAGAACGTGACCCACAA TGAC 
Gl yArgGlu7hr I 1 e7y r P coAsnAl aSe r Leu Leu I IcGLnAsnve iTh r Gl nAsnAsp 

^ 00 «S0 010 

ACAGGATTCTACACCCTACAAGTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGA 
ThcGlyPhcTy c ThcLeuGlnVal 1 1 e Ly sSe r As pLe uVa 1 AsnGl uG 1 uA 1 aTh r G 1 y 

45 



50 
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510 s 3 0 

CAGTTCCATG7ATACCCGGACCTCCCC^vCCCCTCCATC7CCAGC/V\C^ACTCC.\.nCCCT 
Gl r.PheH: sv» LTv : ? :DGluLeu?coLvs?roSe c 1 1 e Se r Se r AsnAsnSe r Asn? ro 



570 



10 



GTGGAGCAC.---.GCA7GC7GTGCCCT*:CACCTGTG.^ACCTGACACTCAGGACACAACC:aC 
va ICluAsp-y sAspAlaVaiAlsP.^eThrCysGluProGluThrGlnAspTh r7h rTy r 



SIC 630 650 

75 C7G7GG7CGA7A.VACA.A7CAGAGCC7CCCGGTCAGTCCCAGGCTGCAGC7G7CC;^.TGGC 
L e u 7 r p7 : z 1 1 e A s - A s nG 1 n S e c Le u ? r oVa 1 S e r P r OA r 9 L e u G 1 n Le u 5 c c A £ nG 1 y 

57: 690 710 

20 AACAGCACCC7CAC7C7ACTCAG7C7CACAACGAATGACACACGACCCT7VTGAG7CTCAA 
A 5 n A r 5 T . s . : > e -J T : L e u L e 'j S e : v 3 IT I u A r q A : n A s pTh cCly? roTyrCl uCy sClu 



73: 750 770 

25 ...... 

A7ACAGAJ^CCCAG7CAG7GCGAACCGCAGTGACCCAG7CACCT7G^TGTCACC7A7CGC 
1 1 e G 1 n A s r. ? r c v * i s 2 r A 1 a A s n A c c S e cAs??roVai7hr LeuAsnv&lThr Ty rGI y 



79: 610 830 

30 

CCGGACACC::CACCA7T7CCCC77CAGACACC7AT7ACCGTCCAGGGGCAAACC7CAGC 
PrcAs?7hr? :o7^r : leSe c? :oSc cAspThi*7y rTy rAcg ? roG.lyAl aAsnLeuSs r 



85: B70 SS0 

CTCTCC7GC7A7GCAGCCTC7A^CCCACCTGCACACTAC7CC7GCC7TATCAArGGA«ACA 
LcuSe rCys7y r AlaAlaSe rAsnProProAlaClnTy cSe rTrpLeoI leAsnGlyThr 

913 930 950 

T7CCAGC.--^0CACACAACACC7C^ 

PheGlnGlnSe c 7h rGlnCluLeuPhe I leProAsnl ieTh c vai AsnAsnSe :Gly 5e r 



970 990 .1010 

TATACCTGCCACCCCAATAACTCACTCACT'JGCTCCAACACCACCACAGTC AAGACGATC 
TycThrCysHisAlaAsnAsnSe c Va lTh rGI yCy sasha c gTh r Th r va 1 Ly s7M 1 I 1 e 
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10 3 0 10 50 10 7 0 

A7AC7C AC7C A7AA7G CTC TACCAC AJkCAJ>AATCGCC 7C7C AC C 7CGCCCC ATTGC "i CC C 
5 I 1 eva 17^ r AspAsnAl a Leu? roGlnGluAsr.GlyLeuSe c ? coGl yAi a I 1 eAlaG 1 

mo 1130 

A77G7GA77GGAG 7AG7GGCCCTCG77GCTCTCA7AGG AG TAG C C C7GGCATG 7777%- tc 
70 I leva 1 I 1 sGlwalValAiaLeuVaiAlaLeuI 1 eAl aVa 1 Ai a LeuAl aCy sPhe.e*. 



115: 1170 11S0 

15 CATTTCGGG.-^GACCGCCAGGCCA.\GCCACCAGCG7GATCTCACAGAGCACAAACCCTCA 
Hi sPheGlyLy sThrGlyAcgAl aSe c AspClnArgAspLeu7hrGiuHi sLys? roSe r 

1210 1230 1250 

20 

G7C7CCa^CCACAC7CACCACCAC7CCAA7CACCCACC7AACAAGA7C*A7CAAGTTACT 
va 1 Se c AsnH i sThrGlriAspHi sSe cAsnAsp? coP roAsnLy srte tAsnGluVa ITh : 



25 



12 7 3 1290 1310 

TA7TCTACC C7G AACTTTGAAGCC CAGCAA.CCC ACACA-ACC AACTTC AGCCTCC CC ATCC 
TvrSe rThrLeuAsr.PheGluAiaGlnGlnProThrClnPcoThrSe rAlaSerProSer 



30 1 2 5 : 1 3 50 1 37 0 

C 7 AA C A G C C A. C A G AAA T AA TTTATTCAG AA G T AAAAAA G C A G T AA T G AAA CCTG7CCTGC 
LeuThrAlaThrGluI Is I leTy cSe cGiuValLysLy sGln 

1390 1<10 1<30 

7CAC7GCAG7GC7GA7GTA77TCAAG7C7C7CACCC7CATCAC7AGGAGA77CC777CCC 

1450 1470 1490 

C7G7AGGG7AGAGGGG7GCGGACAGAAACAAC77TCTCCTAC7C77CC77CC7AA7AGGC 

4 * 1 5 1 0 1 530 15S0 

ATC7CCAXCC 7GC CTGGTC AC7GC CC CTCTC7C AGTGTC AATA-G ATGAAAGTAC ATTGGG 



40 



50 



1590 1610 
AG7CTGTAGG/nAACCCAJKCCT7CT7C7CATTGAAATTTGGCAAAGC7CAC7TTGGGAAvAC 
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153: 1650 1670 

AGOG AC C AG AA-C 77C C C C 7 C C C TTC C C C TTTTCC C AA C C 7GG AC 7TC7777 AAAC TT 3 C C 

5 

15?: . . 1710 1720 

TGTTCAGAGCAC7CATTCCTTCCCACCCCCACTCCTG7CCTATCAC7CT.\ATTCGGATTT 

10 

1750 1770 1790 

GCCAiAGCC77GAGG77A7G7CC7777CCAT7^G7ACA7G7CCCAGGAAACAGCGAGAG 

75 

1810 1830 1850 

A G A G AAAG 7 AAA CG G C AG 7 AA TG C 7TCT C CTATTTC TC C AAAG C C 7TG TG 7G AA C T A C C A 

20 

1370 1890 1910 

AAGAGAAGAAAA7CAAA7ATA7AACCAyATACTCAAA7GCCACAGG7T7G7CCAC7G7C AG 

25 

1930 1950 1970 

GC77G7C7ACC7G7AGGA7CACGG7C7A.\GCACCT7GGTGCTTAGC7AGAA7ACCACC7A 

19S0 2010 2030 

A7CC77C7GCC>^GCCTG7C77CAGAGAACCCACTAGAAGCAACTAGGAAAAA7CAC77G 

35 

2050 2070 2090 

CCAAAA7CCAAGCCAATTCC7GATGGAAAATGCAAAAGCACATATA 7G77TTAA7A7CT7 

40 21 10 2130 2150 

7A7GGGC7C7G77CAAGGCAC7GCTGAGAGCCAGGGGTTATAGC7TCAGGAGGGAACC/-.G 

45 2 1 70 2190 2210 

CTTCTCATAAA<^AC AATCTGCTAGGA^CTTGGGAAAGGAATC AGAGAGCTGCCCTTCAGC 
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Hi: 2 2 50 2270 

CA T7A777 AAA77G77AAAGAA7AC AC AA777GGGG7A77GGGA77777C : CC r ; * iv- 

5 

2 3io 22.20 

7 G A G A C A 7 7 1 C A C C A 7 7 7 7 AA 7 777 7G 7 AA C 7G C T7 A 777 A 7C 7 GAAAAG-C GT 7 AT 77 7 T 

10 

22 5: 2270 2390 

AC77AGC77AGC7A7C7CAGCCAA7CCGAT7CCC77AGG7CAAAGAAACCACCGAAA7CC 

75 

2^10 2430 \ 2450 

CTCAGG7CCC77GG7CAGGAGCC7C1XAAGA7T7TTTT7GTCAGAGCCTCCAAATAGAAA 

20 

2 4 7 0 2 490 2510* 

A7AAGAAAACG7777C77CA77CA7GGCTAGACCTAGATTTAACTCAG777CTAGGCACC 

2S30 2550 2570 

7CAGACCAA7CA7CAAC7ACCAT7CTATTCCATCTTTCCACC7GTGCATT7TCTGTT7CC 

30 2590 2610 2630 

CCCCA77CAC777G7CAGGAAACCTTGGCCTC7GCTAAGGTGTATTTGGTCCT7GAGAAG 

35 2650 2670 259C 

7GGGAGCACCC7ACAGGGACAC7ATCACTCATGCTGGTGGCATTGTTTACAGCTAGAtAG 

40 2710 2730 2750 

C7GCACrGG7GC7AA7GCCCC7TGGGA/vA7CGCCCTCTGAGGAGGAGGATTA7AAC7TAG 

45 2770 2790 20 10 

G C C 7 A C C C 7 C 7777 AA C A G C C 7 C TC AAA 777 A TC T7TTC TTC 7 A7G C G GT C 7 A T AA A 7 C 7 



50 



2830 2050 2370 

* . . . 

A TC T TA TAA rA/^AAGGAACCACAGGAGGAAGACAGGCAAATGTACTTC7CACCCAG TC7 
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2 553 2910 2930 
?C7ACACACA?CCAA7C7C777CGGGC7AAGAGAAACG7777A7TC7ATA77Gr7TACC7 

5 

2950 2970 2990 

GATCTwATG?TAGGCCT.^GACCC'"TTCTCCAGGAGGATTAGGTTGGAGTTCTCTATACT 

10 

3 C 1C 3030 3050 
CAGG7ACC7CTT7CAGCGTTT7CTAACCCTGACACGGACTGTGCATACTTTCCCTCATCC 

75 

IZ'Z 3090 3110 

A7GC7G7GC7C7GTTATTTAATTTTTCCTGGCTAAGATCATGTCTGAATTATGTA7GAAA 

20 

3130 3150 3170 

ATTATTCTATGTTTTTATAATAAAAATAATATATCACACATCCAAA/vA-fiwAAAA, 

25 



30 



35 



40 
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i: 30 50 

5 

CACCCCTGCTTGAAGCGTTCCTGCACCCCAACCTCTCCTCCACAGGTGAAGACAGCGCC 1 



- so no 

70 . . . . . 

GCAGGAGACA-CA7GGGGCACCTC7CAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
*e cGiyKi sLeuSe rAlaProLeuHi sAcgValArgVal?ro7rpGin 

is 132 150 170 

GGGC77C7GC7CACAGCC7CAC7TCTAACCTTCTGGAACCCGCCCACCAC7GCCCAGC7C 
GlyLeuLe-jLauThrAlaSerLeuLeuThcPheTrpAsnPcoProThrThrAlaGlnLeu 

20 

190 210 i30 

* " . „ * 

AC7AC7GAA7CCA7GCCA77CAA7G77GCAGAGGGGAAGGAGG77C77C7CC77C7CCAC 
7hfThcGiui2rnet?:oPheAsnValAlaGluGlyLysGluValLeuLeuLeuValHis 



25 



30 



250 270 290 

AA7C7GCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATCGC^AC 
Asr.Leu?:cC:r.Gl.nLej?heGly7yrSerTrp7yrLysGlyGluArgVaiAspGlyAsn 

3i= 330 350 

CGTCA^TTGTAGGATATGCAATAGGA^CTCAACAACCTACCCCAGGGCCCGCAAACAGC 
AfcGlnlleVaiGly7yrAlaIleGlyThrGlnClnAlaThr?roGly?roAl a AsnSer 



35 



370 390 410 

40 GG7CGAGAGACAA7ATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAGAATGAC 
GlyArgGluThrllcTyrProAsnAlaSerLeuLeuIlcGlnAsnValThrGlr.AsnAsp 



45 



50 
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450 470 



70 



ACAGCATTCTACACCC7AC.^CTCA7Ay^C7CACATCTTG7GAA7CAACAAGCAACTGGA. . 
TfcrGi v?he7y :7hr -euGlr.Vai I ieLysSe rAspLeuVai AsnGluGiuAi a7h rGly 

<9C- 510 5 30 

« • • 

CAG77CCA7GTA7ACCCGGAGC7GCCCAAGCCC7CCA7C7CCAGCAACAAC7CCAACCCT 
Glri?heHisVal7yr?:cGluLeuPcoLys?roSerIleSecSecAsnAsnSerAsn?ro 

550 570 590 

G7GGAGGACAAGGA7CC7G7GGCC77CACC7G7GAACC7CAGAC7CAGGACACAACC7AC 
75 ValGIuAs?LysAs?AlavaiAla?he7hrCysClu?coGlu7hrGlnAso7hr7hr7y r 

610 630 650 

20 C7G7GG7GGA7AAACAA7CAGAGCC7CCCGG7CAG7CCCAGGC7GCAGC7G7CCAA7GGC 
Leu7r?7r?I ieAsr.AsnGlnSe rLeuPcoValSe cProArgLeuGlnLeuSe rAsnGly 

670 690 710 

25 • • • • » • 

AACAGGACCC7CAC7C7AC7CAG7G7CACAAGGAA7GACACAGGACCC7A7GAG7G7GAA 
AsnAc97hr L»j7hr LeuLeuSe rVal7hrArgAsnAspThrGly?ro7y cGluCy sGlu 



30 



730 750 770 

A7ACAGAACCCAGTGAG7GCGAACCGCAG7GACCCAGTCACCTTGAATG7CACC7A7GGC 
IleGlnAsnPcoValSerAlaAsnArgSecAsp?roValThrLeuAsnVaiThc7yrGly 

35 

•790 810 830 

• • • * • * 

CCGGACACCCCCACCA777CCCC77CAGACACC7A77ACCG7CCAGGGGCAAACC7CAGC 
ProAsp7hr ?ro7hr I lcSe r PcoSe rAsp7hr 7y r7y cArgP coGlyAl aAsnLcuSe r 

40 



45 



50 
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( 



5 



70 



20 



25 



35 



40 



45 



50 



S SO 670 890 

C7C7C C7CC7A7GCAGCC7C7AACCCACC7GCACAG7AC7CC7GCC77A7CAA7GGAACA 
LeuSe rCvsTv? :Ai *Aia52 rArs-n?ro?roAieGlnTy cSerTrpLeu! 1 eA-s-rvGl y?h r 



?i: 930 950 

TTCCAGCA.^GCAC ACAAGAGC7C7T7A7CCC7AACA7CAC7G7GAA7AA7AG7GGA7CC 
PheGlnClr.Se :7h cCl nCluLeuPhe 1 1 eProAsnl leThr va 1 AsnAsnSe rGl y Se r 



970 990 1010 

. • • • • 

75 7A7ACC7GCCACGCCAA7AAC7CAG7CACTGGCTGCAACACGACCACAGTCAAGACGATC 
Ty r 7h rCysHisAlaAsnAsnSe r va 1 7h rGl yCy sAsnAr gTh r7hr Val Ly s7h rile 



1030 1050 1070 

A7AG7CAC7GAGC7AAG7CCAG7AG7ACCAAAGCCCCAAATCAAAGCCAGCAAGACCACA 
IleVal7hrGiuLeuSer? roVa 1 Va lAlaLysProGlnlleLysAlaSer Ly sTh r7h r 

1 C 9 C 1110 1130 

G7CACAGGAGA7AAGGAC7C7G7GAACCTGACC7GC7CCACAAA7GACACTGGAA7C7CC 
Val7hrGiyAs?LysAspSe rValAsnLeuThrCysSe cThrAsnAsp7hrGlyI leSe r 

30 1150 1170 1190 

■ « • • • • 

A7CCG77GC77CT7CAAAAACCAGAGTCTCCCGTCCTCGGAGAGGATGAAGCTGTCCCAG 
IleArgTroPhePheLysAsnClnSerLeuProSerSerGluArglletLysLeuSerGln 



1210 1230 1250 

GCCAACACCACCC7CAGCATAAACCC7GTCAAGACGGAGGA7CC7GGGACCTA77GGTC7 
ClyAsn7hr7hrLeuSe r!leAshProValLysArgGluAspAlaGiy7hrTy r7rpCys 
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1290 i3io 

CAC37C77CAACCCAA?CAG7A^GAACCAAAGCCACCCCA7CA7GCTGAACG-AAAC7A7 
5 CiuVel PheAsr.? :ol leSe :LysAsnGlnSerAspProI l erte tLeuAsr.valAsnTy r 

1350 1370 

J0 AA.TGCTCTACCACAAGAAAA7GGCCTCTCACCTGGGGCCATTGCTGGCAT7GTGATTGGA 
Asr.AlaLe-rrsGinGlaAsnGlyLeuSecProGlyAlalleAlaGlylleVallleGly 

13S0 1410 1430 

75 

G7AGTGGCCCTGG7TGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTGCATTTCGGGAAG 
ValValAiaLeuVaiAlaLeuIleAlaValAlaLeuAlaCysPheLeuHisPheClyLys 

2Q 1450 1470 1490 

ACCGGCAGC7CAGGACCACTCCAATCACCCACCTAACAAGATGAATGAAC7TACTTATTC 
ThrGlySc rSe :Gly? roLeuGln 

25 1510 1530 1550 

TACCCTGAACTTTG^GCCCAGCAACCCACACAACC^CTTCAGCCTCCCCATCCCTAAC 

*> 1570 1590 1610 

AGCCACAGAAATAATTTATTCAGAAGTAAAAAAGCAGTAAT^ 

35 1630 
AAAAAAAAAA 

40 
45 
50 



55 



( EP 0 346 710 B1 



(3) 



::• 30 so 

C AG CC GTG CT Z Z C- C G 7 7 C C TGG AGC C C AAGC7C7C CTCC ACAGC7C AAG AC AGGGC C A 

w 

: : 90 no 

GC AGG AG AC AC C AT G GCGC AC CTCTCAGCCCCACTTC AC AG AGTGCGTGTACCCTGGC AG 
r»s ;G 1 vKi s LeuSe r Al a?roLeuK i sAr ova 1 Argva 1 ? r o7r pel n 



2 30 ISO 170 

CCGC77C7GC7CACACCC7CAC77C7AACC77C7GCAACCCGCCCACCAC7CCCCAGC7C 
20 GlyLeuLeuLeuThrAlaSe cLeuLeuTh i-Phe7rpAsn?roPco7hc7hrAlaGinLeu 

190 210 2 30 

25 AC7AC7GAATCCA7CCCA77CAATG77GCAGAGGGCAAGCAGG77C77C7CC77G7CCAC 
7hr7hrGiuSe r Me t ? r o?heAsnVa 1 Ai aGl uGl y Ly sG luVa 1 Leu Leu Le uVa i K i s 



250 270 290 

30 ...... 

AA7C7GCCCCAGCAAC77777GGC7ACAGCTGG7ACAAAGCGCAAAGAC7GGA7GGCAAC 
AsnLeu? rcSlr.GlnleurheSiyTy :Se r7rp7y r LysGlyCluArgVaiAspGlyAsr. 



Si: 330 350 

CG7C.^A77G7AGGA7A7CCA.a7AGGAAC7CAACAAGC7ACCCCAGGGCCCGCAAACAGC 
ArsGlnl i evaiciy7y rAial leGiy7hrGInGlnAla7hr?c oGly ? roAl aAsnSe c 



370 390 410 

GG7CCAGACACA < -A7A7ACCCCAA7GCA7CCCTCC7GA7CCACAACC7CACCCAC/VA7GAC 
GlyAcgGlu7hr I le7y rProAs nA la Sc r LeuLeuIleClnAsnval7hcGlnAsnAs? 



430 450 470 

ACAGGAT7C 7ACACCC7ACAAC7CA7AAAG7CAGA7CT7G rCAA7CAACAAGCAACTCG A 
7hrGlyPhe7y cThr LeuGlnval I leLy sSe r As pLeu va 1 AsnCl uGl uA 1 a7h r G 1 y 
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4 ? :• S 1 0 5 3 0 

CACTTCCATCTA?ACCCGCACCTCCCCAACCCCTCCATCTCCAGCAAC^CTCCAACCC7 
GlnPheHisVciTyrProCluLeuProLysPcoSerlleSecSecAsriAsnSerAsp.? r*o 

5 ' ■ 

550 570 590 

GTGGAGGACA.^GGATGC7GTGGCCTTCACCTGTGAACCTGAGACTCAGGACAC/\ACCTAC 
70 vaiGluAs?LysAspAlaVdiAlaPheThrCysCluProGluThrGlnAspThr?h:Ty r 

610 630 650 

75 CTGTGGTCGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGCCTGCAGCTGTCCA^TGCC 
LeuTrpTrpIl eAsnAsnGlnSe r LeuProValSe rProArgLeuGlnLeuSe rAsnCl y 

670 690 710 

20 - 

AACAGCACCCTCACTCTACTCAGTCTCAC/vAGCAATGACACACGACCCTATGAGTGTG.^ 
Asr.ArgTh : LsuTh cLeuLeuSe r va lTh rArgAsnAspTh rGlyP roTy rGluCy sGlu 

730 750 770 

25 

ATACAG.\ACCCAGTGAGTGCGAACCGCAGTCACCCAG7CACCTTG>lATGTCACCTATGGC 
IleGlnAsnProValSerAl aAsnA cgSe rAspPcoValThr LeuAsnValTh rTy r Gl y 

30 

■? = :■ a l o 330 



35 



C Z Z C A J A C Z Z Z Z A C C A T T T C C C C T T C A G A C AC C T A TT AC C GTCCAGG^oC AAA C C T C A G C 
? :cAs?7>. : ? rcT'r. : 1 1 ?<e r?:c-Se tWspThrTy cTyrAcc? coGlyAiaAsnLeuSfj 



35: 970 e90 

ctctcctc::a7gcagcctct^.cccacctgcacagtactcctggcttatcaatcg.^aca 

40 LeuSerCvsTviAiaAldSerAsnProProAlaGlnTvrSerTroLeuIleAsnGlvThr 



910 930 950 

45 TTCCAGC AA^GC ACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PhcGlnGlr.Sc f TMrGlnCluLcuPhe 1 1 e ? r OAs n 1 1 cTh cValAsnAsnSe rGlySc c 

970 990 1010 

50 

TATACC7CC C ACGCC A AT AACTCAGTC ACTCGCTCC AAC ACG ACC AC AGTC A.AGACGATC 
Ty cT?> rCy sH I sa! aAsnAsnSe cVa LTh cGlyCysAsnAc gTh cTh cValLysTh c I 1 e 
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1030 1050 1070 

ATAC7CAC7SATAA7CCTC?ACCACAACAAAATCCCCTCTCACC7CCCCCCAT7CC7CGC 
s : ievslThrAspA.s.-Al4Leu?coClaGluAsr.ClyLeuSe c?coClyAlaIieAlaGiy 

1090 1110 1130 

io ATTC7GATTCGACTAC7CGCCC7GGT7GC7C7GATAGCA.G7AG,CCC7GGCATC7TTTCTC 
I leva li leClyvalvalAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPhsLeu 

1 i 5 G 1170 1190 

15 ■ ■ 

CA7T7CGGGAAGACCGGCAGC7CAGGACCACTCCAATGACCCACCTAACAAGATGAATGA 

His?heGlyLys7hrGlySecSecGlyProLeuGln 

20 1 210 1230 1 250. 

AGTTAC7TATTC7ACCC7GAACTTTGAAGCCCAGCAACCCACACAACCAACT7CAGCCTC 

25 1270 1290 1 310 

CCCATCCCTAJStCACCCACACAAATAATTTATTCACAAGTAAAAAACCACTAATCAAACCT 



1330 

30 

G AAAAAAAAAAAAAAAAAA 
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: acagcacagcrgacagccgtactcaggaagcttccggatcctaggcttatctccacacag 60 

5 

5 1 aacaacacacaagcaccagagacca tgggccccctc tcagcccctccc tccacacacctc 120 

MccCiy ? roi-suSe c Ai a ?r o Pc oCysTh r H i sLeu 

121 atcactccgaaccc^c.ccctgcccacagcatcaccLttaaacttctggaatccgcccaca I S O- 
Il eThrTcpLysG iyVai Leu LeuTh rAiaSe rLeuLeuAsnPheTrpAsnP ro? roTh : 

i a i actccccaaCwCacgattgaaccccagccacccaaagtttctgagggcaagcatctcctt 2 c 0 
ThrAlaGlnValThr IleGluAlaGlnProProLysValSerCluGlyLysAspValLeu 

241 ctacttgtccacaa tttgccccagaatct tgctggctaca tttggtacaaagggcaaa tc 3 00 
LeuLeuValKisAsnLeu? roGinAsnLeuAlaGiyTy rlleTrpTyr LysGlyGlnKe t 

J^i acatacgtctaccattaca ttaca tea tatgtagtagacggtcaaagaa ttata ta tggg 360 
ThrTyrValTycHisTyrlleThrSerTyrValValAspGlyGlnArgllelleTyrGly 

361 cctgcatacagtcgaagagaaagagtatattccaatgcatccctgctgr tccagaatgtc 4 20 
ProAlaTy r Se rGlyAcgGluArgValTy rSe cAsnAlaSe r LeuLeuI leGlnAsnVal 

20 <21 aegcaggagga tgcagga tec tacaccttacacatca taaagegacgega tgggac tgga 480 
ThcGlnGluAspAlaGlySerTyrThrLeuHisIlelleLysArgAcgAspGlyThcGly 

461 ggagcaactggacatttcaccttcaccttacacctggagactcccaagccctcca tctcc 54 0 
GlyVaiThrGlyHisPheThrPheThrLeuHisLeuGluThrProLysProSecIleSe r 

25 54 1 agcaccaacttaaatcccaggcaggccatggaggctgtgatcttaacctgtgatcctacg 6 00 
SerSerAsnLeuAsn?roArcGluAlartetGluAlaValIleLeuThrCysAsp?rcAia 

501 actccagccgcaagc ^accag tggtgcatgaa tggtcagagee tcccta tgactcacacc 6 50 
ThrProAlaAJaSerTyrGlnTrpTcpfletAsnGlyGlnSerLeuProHetThrHisArg 

30 fe&i ttccagctgtccaaaaccaacaggaccctctttatatttgctgtcacaaagtatattgca 7 20 
LeuGinLeuSerLysThrAsnArgThrLeuPhellePheGlyValThrLysTyrlleAla 

"721 ggaccctatgaatgtgaaatacggaacccagtgagtgccagccgcagtgacccactcacc 7 30 
GlyProTyrCluCysGluIleArgAsnProValSerAlaSerArgSecAspProValThr 

35 731 ctgaatctcctcccaaagctgtccaagccctacatcacaatcaacaacttaaaccccaca 8 40 
LeuAsnLcuLeuProLysLeuSecLysProTyrlleThrlleAsnAsnLeuAsnProArg 

34 1 gacaataaggatgtcttaaccttcacctgtgaacctaagaqtgagaactacacctaca *t 900 
GluAsnLysAspValLeuThrPheThcCysGluProLysSerGluAsnTyrThrTyr I le 

40 901 tggtggctaaatggtcagagcctccctgtcagtcccagggtaaagcgacccattgaaaac 9 60 
Trp7:pLeuAsnGlyG : lnSerLeu?roValSerPccAcgValLysArgPL OI leGluAsn 

96 1 ocsatcctcattctacccaatgtcacgagaaatgaaacaggaccttatc-aatgtgaaata 1020 
Arc: leLeuI leLeu? coAsnVa lTh cA r gAsnGluThrGlyP roTy rGlnCysGlu I le 

45 1021 cgggaccgatatggtggcatccgcagtgacccagtcaccctgaatgtcctctatagtcca 108 0 
ArgAspAcgTycGlyGlylleArgSerAspProValThrLeuAsnValLeuTyrGlyPro 
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( 



1081 gac::cccceccat:tacccttcattcacctattaccgttcaggagaaaacctctacttt 1K0 
AspLeuPrcSer IleTy r ProSerPh'.ThrTy rTy rArgSe rGlyGluAsnLeuTy rPhe 

1141 tcctccttccctcagtctaacccacgggcacaatattcttggacaattaatgggaagttt 1200 
SerCysPheGlyGluSe r AsnProAr gAlaGl r»Ty rSe cTrpThr 1 1 eAsnGiy Ly sPhe 

1201 cacctatca=car£aaacctctctatcccccaaataactacaaagcatagtgggctctat 12 5- 
GlnLeuSe rGlyGlnLys LeuSe rll eProGlnl leThrThr LysHi sSe rGiyLeuTy c 

1251 get :cc :c :c;:cr;aactcagccac tggcaaggaaagc tccaaa tccatcacac tea 2 a 12 20 
AlaCysSe z VaiAccAsriSe rAlaThrGlyLysGluSe rSe r LysSer I leThrValLys 

13 21 gtctctgaccggatattaccctgaattctactagttcctccaattccattttctcccatg 1 3SG 
vaiSe rAspTrp I leLeuP roEnd 



|i81 gaatcacgaagagcaagacccactctgttccagaagccctataatctggaggtggacaac 1440 

1^4 41 tcgatgtaaatttcatgggaaaacccttgtacctgacatgtgagccactcagaactcacc 1500 

1501 aaaatgttcgacaccataacaacagctactcaaactgtaaaccaggataagaagttgatg 1560 

1561 acttcacactgtggacagtttttcaaagatgtcataacaagactccccatcatgacaagg 1620 

1621 ctccaccctctactgtctgctcatgcctgcctctttcacttggcaggataatgcagtcat 1680 

1681 tagaatttcacatgtagtagcttctgagggtaacaacagagtgtcagatatgtcatctca 1740 

1741 acctcaaacttttacgtaacatctcagggaaatgtggctctctccatcttgcatacaggg 1800 

18 01 ctcccaatagaaatgaacacagagatattgcctgtgtgtttgcagagaagatggtttcta 18 60 

1861 taaagagtaggaaagctgaaattatagtagagtctcctttaaatgcacattgtgtggatg 1920 

1921 gctctcaccatttcctaagagatacagtgtaaaaacgtgacagtiaatactgattctagca 1980 

1981 gaataaacatgtaccacatttgcaaaaaa 2010 
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EP 0 346 710 B1 



(5) 



I gggtgga tcr-aggctca tctccataggggagaacacacatacagcagagacca tggga 59 
5 ' KetGly 

5 0 cccctctcaccccctccctgcactcagcaca tcacc tggaaggggc tec tgc tea cage a 119 
ProLeuSe rAla? ro?roCysThrGlnHi s I le ThrTrpLy sGlyLeuLeuLeuTh cAla 

120 tcacttttaaacwtctgcaacctgcccaccactgcccaagtaataattgaagcccagcca ITS 
1Q SerLeuLeuAsnPheTrpAsnLeuProThrThrAlaGlnValllelleGluAlaGlnPro 

180 cccaaagtttctgaggggaaggatgttcttctacttgtccacaatttgccccagaatctt 2 39 
ProLysValSerGluGlyLysAspValLeuLeuLeuValHisAsnLeuProGlnAsnLeu 

2 40 actggctacacccggcacaaagggcaaatgacggacctctaccattacattacatcatat 299 
75 ThrGlyTyrlieTrpTyrLysGlyGlnMetThrAspLeuTy.rHisTyrlleThrSerTyr 

3cb0 gtagtagacggtcaaattatatatgggcctgcctacagtggacgagaaacagtatattcc 355 
ValValAspGlyGlnllelleTyrGlyProAlaTyrSerGlyArgGluThrValTyrSer 



20 



25 



30 



35 



40 



45 



3 60 aatgcatccctgctgatccagaatgtcacacaggaggatgcaggatcctacaccttacac 419 

AsnAlaSerLeuLeuIleGlnAsnValThrGlnGluAspAlaGlySerTyr-ChrLeuHis 

4 20 atcataaagccaggcgatgggactggaggagtaactggatatttcactgtcaccttatac 479 

IlelleLysArgGlyAspGlyThrGlyGlyValThrGlyTyrPheThrValThrLeuTyr 

4 80 tcggagactcccaagcgctccatctccagcagcaacttaaaccccagggaggtcatggag 539 
SerGluThrPrcLysArgSerlleSerSerSerAsnLeuAsnProArgGluValMetGlu 

54 0 gctgtgcgcttaatctgtgatcctgagactccggatgcaagctacctgtggttgctgaat 599 
AlavalArgLeulleCysAspProGluThrProAspAlaSerTyrLeuTrpLeuLeuAsn 

600 ggtcagaacctccctatgactcacaggttgcagctgtccaaaaccaacaggaccctctat 659 
GlyGlnAsnLeuPro.HetThrKisArgLeuGlnLeuSerLysThrAsnArgThrLeuTyr 

£60 ctatttggtctcacaaagtatattgcagggccctatgaatgtgaaatacggaggggagtg 719 
LeuPheGlyValThrLysTyrlleAlaGlyProTyrGluCysGluIleArgArgGlyVal 

720 agtgccagccgcagtgacccagtcaccctgaatctcctcccgaagctgcccatgccttac 779 
SecAlaSerAccSerAspProValThrLeuAsnLeuLeuProLysLeuProHetProTyr 

78 0 atcaccatcaacaacttaaaccccagggagaagaaggatgtgttagccttcacctgtgaa 8 39 
IleThrlleAsnAsnLeuAsnProAcgGluLysLysAspValLeuAlaPheThrCysGlu 

84 0 cctaagagtcggaactacacctacatttggtggctaaatggtcagagcctcccggtcagt 899 
ProLysSerArgAsnTyrThrTyrlleTrpTrpLeuAsnGlyGlnSerLeuProValSer 

900 ccgagggtaaagcgacccattgaaaacaggatactcattctacccagtgtcacgagaaat 959 
ProArgValLysArgProlleGluAsnArglleLeuIleLeuProSerValThrArgAsn 

96 0 gaaacaggaccctatcaatgtgaaatacgggaccgatatggtggcatccgcagtaaccca 1019 
GluThrGlyPcoTyrGlr.CysGluIleArgAspArgTyrGlyGlylleArgSerAsnPro 
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5 



70 



75 



1020 'gtLcaccetcaatctcctctatggtccagacc :ccccacaa tttaccc t tact tea cc tat 107 9 
valThr LeuAsnvalLeuTyrGlyProAspLeuPcoArglleTyrProTyrPheThrTyr 

108 0 taccgttcaccacaaaacctcgacttgtcctcct ttgecgae tctaacccaccggcagag i i 39 
TyrArcSerGlyGluAsnLeuAspLeuSerCysPheAlaAspSerAsnProProAlaGlu 

1140 ta tttttgcacaattaatgggaagtttcagctatcaggacaaaagc^ctttatcccccaa 119 9 
TyrPheTrpThrlleAsnGlyLysPheGlnLeuSerGlyGlnLysLeuPhelleProGln 

120 0 a ttactacaaa tea tagcgggctctatgcttgc tctgttcgtaac tcagccactggcaag 12 3 9 
1 1 eThrTh rAsr.Ki sSe cGly LeuTy rAl aCy sSe rValAr gAsnSe rAlaTh rGl yLy s 

125 0 gaaatctccaaatccatgatagtcaaagtctctggtccctgccatggaaaccagacagag 1219 
GluIleSerLysSerrtetlleValLysValSerGlyProCysHisGlyAsnGlnThrGlu 

1320 tctcattaatcgctgccacaatagagacactgagaaaaagaacaggttgataccttcatg 13 79 
SerHisEnd 

• . 

1380 aaattcaagacaaagaagaaaaaggctcaatgttattggactaaataatcaaaaggataa 1439 

14 40 tgttttcataatttttattggaaaatgtgctgattcttggaatgtttt'attctccagatt 14 S9 

1500 tatgaactttttttcttcagcaattggtaaagtatacttttgtaaacaaaaattgaaaca 1559 

1560 tttgcttttgctctctatctgagtgccccccc 1591 



20 



2. Vehicule de clonage recombinant apte a une replication, comportant un produit d'insertion comprenant 
un acide nucleique selon la revendication 1 . 

25 3. Cellule qui a ete transfectee, infectee par un vehicule de clonage recombinant selon la revendication 2, 
ou a laquelle on a injecte ce dernier. 

4. Proced6 pour preparer un polypeptide, ledit procede comprenant les etapes consistant a : 

(a) cultiver la cellule selon la revendication 3, et 
30 (b) recuperer le polypeptide exprime par ladite cellule. 

5. Procede pour preparer un anticorps dirige contre un polypeptide, ledit procede comprenant les etapes 
consistant a : 

(a) preparer ledit polypeptide par le procede selon la revendication 4, 
35 (b) injecter ledit polypeptide dans un hote capable de produire des anticorps, et 

(c) recuperer lesdits anticorps. 
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TMKCEA(c) ) 



TM2(CEA(e) ) 



TM3(CEA(f) ) 



TMMCEA(g) ) 




FIG.1 
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CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHisValTyrProGluLeuProLysProSerlleSerSerAsnAsnSerAsnPro 



550 570 590 

GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAlaValAlaPheThrCysGluProGluThrGlnAspThrThrTyr 

10 

610 630 650 

• • - . 

CTGTGGTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
15 LeuTrpTrpIleAsnAsnGlnSerLeuProValSe rProArgLeuGlnLeuSerAsnGly 



670 690 710 

• • • • , 

AACAGGACCCTCACTCTACTCAGTGTCACAAGGAATGACACAGGACCCTATGAGTGTGAA 
AsnArgThcLeuThrLeuLeuSe rValThrArgAsnAspTh rGlyP roTy rGluCysGlu 



730 750 770 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
IleGlnAsnProValSerAlaAsnArgSerAspProValThrLeuAsnValThrTy rGly 

790 810 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
ProAspThcProThrlleSerProSerAspThcTyrTyrArgProGlyAlaAsnLeuSer 



850 870 890 

35 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGGAACA 
LeuSerCysTyrAlaAlaSerAsnProProAlaGlnTyrSerTrpLeuIleAsnGlyThr 



40 



910 930 950 

• • • • 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PheGlnGlnSerThrGlnGluLeuPhelleProAsnlleThrValAsnAsnSerGlySe r 

970 990 1010 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
Ty rThrCysHi sAl a AsnAsnSe c ValTh r Gl yCy sAsnA r gTh rThrValLysThrlle 

50 

1030 1050 1070 
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ATAGTCACTGAGCTAAGTCCAGTAGTAGCAAAGCCCCAAATCAAAGCCAGCAAGACCACA 
IleValThrGluLeuSerProValValAlaLysProGlnlleLysAlaSe cLysThrThr 



1090 1110 1130 

GTCACAGGAGATAAGGACTCTGTGAACCTGACCTGCTCCACAAATGACACTGGAATCTCC 
ValThrGlyAspLysAspSerValAsnLeuThrCysSerThrAsnAspThrGlylleSe r 



1150 1170 1190 

♦ • • ■ • • 

ATCCGTTGGTTCTTCAAAAACCAGAGTCTCCCGTCCTCGGAGAGGATGAAGCTGTCCCAG 
IleArgTrpPhePheLysAshGlnSerLeuProSe rSe rGluArgMe tLysLeuSe rGln 



1210 1230 1250 

• • • . 

GGCAACACCACCCTCAGCATAAACCCTGTCAAGAGGGAGGATGCTGGGACGTATTGGTGT 
GlyAsnThrThrLeuSerlleAsnPL-oValLysArgGluAspAlaGlyThcTyrTrpCys 



• 1270 1290 1310 

• ••*•# 
GAGGTCTTCAACCCAATCAGTAAGAACCAAAGCGACCCCATCATGCTGAACGTAAACTAT 
GluValPheAsnProIleSerLysAsnGlnSerAspProIleMetLeuAsnValAsnTyr 



1330 1350 1370 

• •«... 
AATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGGCCATTGCTGGCATTGTGATTGGA 
AsnAlaLeuProGlnGluAsnGlyLeuSerProGlyAlalleAlaGlylleVallleGly 



1390 1410 1430 . 

• • . - . . 

GTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTGCATTTCGGGAAG 
ValValAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeuHisPheGlyLys 



1450 1470 1490 

• ••».. 
ACCGGCAGGGCAAGCGACCAGCGTGATCTCACACAGCACAAACCCTCAGTCTCCAACCAC 
ThrGlyArgAlaSerAspGlnArgAspLeuThrGluHisLysProSerValSerAsnHis 



1510 1530 1550 

ACTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGAAGTTACTTATTCTACCCTG 
Th rGlnAspHi sSecAsnAspProProAsnLystte tAsnGluValTh cT'y r Se cThcLeu 



1570 



1590 
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AACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTCCCCATCCCTAACAGCCACA 
AsnPheGluAlaGlnGlnProThrGlnProThrSerAlaSerPcoSe cLeuThcAlaThr 



1630 1650 1670 

• ♦ • • . . 

GAAATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCTGTCCTGCTCACTGCAGTGC 
GluIlelleTyrSerGluValLysLysGln 

1690 1710 1730 

TGATGTATTTCAAGTCTCTCACCCTCATCACTAGGAGATTCCTTTCCCCTGTAGGGTAGA 

1750 1770 1790 

GGGGTGGGGACAGAAACAACTTTCTCCTACTCTTCCTTCCTAATAGGCATCTCCAGGCTG 

1810 1830 1850 

• • • 

CCTGGTCACTGCCCCTCTCTCAGTGTCAATAGATGAAAGTACATTGGGAGTCTGTAGGAA 

1870 1890 1910 

• * • • • 

ACCCAACCTTCTTGTCATTGAAATTTGGCAAAGCTGACTTTGGGAAAGAGGGACCAGAAC 
1930 1950 1970 

• • • • • • 

TTCCCCTCCCTTCCCCTTTTCCCAACCTGGACTTGTTTTAAACTTGCCTGTTCAGAGCAC 

1990 2010 2030 

TCATTCCTTCCCACCCCCAGTCCTGTCCTATCACTCTAATTCGGATTTGCCATAGCCTTG 

2050 2070 2090 

■ • • • « . 

AGGTTATGTCCTTTTCCATTAAGTACATGTGCCAGGAAACAGCGAGAGAGAGAAAGTAAA 

2110 2130 2150 

CGGCAGTAATGCTTCTCCTATTTCTCCAAAGCCTTGTGTGAACTAGCAAAGAGAAGAAAA 

2170 2190 2210 

TCAAATATATAACCAATAGTGAAATGCCACAGGTTTGTCCACTGTCAGGGTTGTCTACCT 
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2230 2250 2270 

GTAGGATCAGGGTCTAAGCACCTTGGTGCTTAGCTAGAATACCACCTAATCCTTCTGGCA 

2290 2310 2330 

AGCCTGTCTTCAGAGAACCCACTAGAAGCAACTAGGAAAAATCACTTGCCAAAATCCAAG 

2350 2370 2390 

GCAATTCCTGATGGAAAATGCAAAAGCACATATATGTTTTAATATCTTTATGGGCtCTGT 

2410 2430 2450 

• ••••• 

TCAAGGCAGTGCTGAGAGGGAGGGGTTATAGCTTCAGGAGGGAACCAGCTTCTGATAAAC 
2470 2490 2510 

- • 

ACAATCTGCTAGGAACTTGGGAAAGGAATCAGAGAGCTGCaCTTCAGCGATTATTTAAAT 

2530 2550 2570 

TGTTAAAGAATACACAATTTGGGGTATTGGGATTTTTCTCCTTTTCTCTGAGACATTCCA 

2590 2610 2630 

CCATTTTAATTTTTGTAACTGCTTATTTATGTGAAAAGGGTTATTTTTACTTAGCTTAGC 

2650 2670 2690 

TATGTCAGCCAATCCGATTGCCTTAGGTGAAAGAAACCACCGAAATCCCTCAGGTCCCTT 

2710 2730 2750 

GGTCAGGAGCCTCTCAAGATTTTTTTTGTCAGAGGCTCCAAATAGAAAATAAGAAAAGGT 

2770 2790 2810 

• • • . • • 

TTTCTTCATTCATGGCTAGAGCTAGATTTAACTCAGTTTCTAGGCACCTCAGACCAATCA 



2830 2850 2870 

• ••••• 

TCAACTACCATTCTATTCCATGTTTGCACCTGTGCATTTTCTGTTTGCCCCCATTCACTT 



EP 0 346 710 B1 



2890 2910 2930 

TGTCAGCAAACCTTGGCCTCTGCTAAGGTGTATTTGGTCC1TGAGAAGTGGGAGCACCCT 

2950 . 2970 2990 

ACAGGGACACTATCACTCATGCTGGTGGCATTGTTTACAGCTAGAAAGCTGCACTGGTGC 

3010 3030 3050 

TAATGCCCCTTGGGAAATGGGGCTGTGAGGAGGAGGATTATAACTTAGGCCTAGCCTCTT 

3070 3090 3110 

TTAACAGCCTCTGAAATTTATCTTTTCTTCTATGGGGTCTATAAATGTATCTTATAATAA 

3130 3150 3170 

AAAGGAAGGACAGGAGGAAGACAGGCAAATGTACTTCTCACCCAGTCTTCTACACAGATG 

3190 3210 3230 

GAATCTCTTTGGGGCTAAGAGAAAGGTTTTATTCTATATTGCTTACCTGATCTCATGTTA 

3250 3270 3290 

ATACTCACGTACCTCTT 



GGCCTAAGAGCCTTTCTCCAGGAGGATTAGCTTGGAGTTCTCT 



3310 3330 3350 

TCAGGGTTTTCTAACCCTGACACGGACTGTGCATACTTTCCCTCATCCATGCTGTGCTGT 
3370 3390 3no 

CTTATTTA^TTTTTGfcTGGCTAAGATCATGTCTGAATTATGTATGAAAATTATTCTATCT 
3430 34 5 o 

TTTTATAATAAAAATAATATATCAGACATCGAAAAAAAAAA 
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A schematic relationship of the transmembrane CEA's, namely TM-1 (CEA-(c)), TM-2 (CEA-(e)), TM-3 
(CEA-(f)) and TM-4 (CEA-(g)) is depicted in Fig. 1 : 

Assuming TM-1 is composed of five sections as depicted in Fig. 1, namely 10, 12, 14, 16 and 18, TM-2 
differs from TM-1 in that the 100 amino acid (100 AA) section 14 is deleted and at splice point 20 betweeen 
25 sections 12 and 16, surprisingly an extra amino acid, namely Asp occurs. 

TM-3 is the same as TM-1 except that section 18 is truncated at splice point 22, i.e., a section of 70 
amino acids is deleted and results in a new section made up of subsections 24 + 26. Surprisingly, 
however, six new amino acids (section 26) occur. Another example of formation of a novel amino acid 
sequence resulting from a deletion of nucleic acid sequence is for platelet derived growth factor-A. 
30 TM-4 is the same as TM-2 up until the end of subsection 24. 

Subsection 24 is contained in section 18 of TM-1 and TM-2, but is not depicted in Fig. 1 for TM-1 and 
TM-2. 

Some CEA epitopes are unique. These are the epitopes which have been useful for distinguishing the 
various CEA-like antigens immunologically. Peptide epitopes are defined by the linear amino acid sequence 

35 of the antigen and/or features resulting from protein folding. The information required for protein folding is 
encoded in the primary amino acid sequence. Therefore, antigenic differences ultimately result from 
differences in the primary structure of the different CEA molecules. The differences residing in the CEA 
protein in the CEA species can thus be determined by determining the primary amino acid sequences. This 
can be most readily accomplished by cloning and sequencing each of the genes for CEA. To determine 

40 which gene products will be most useful for cancer diagnosis, unique probes can be selected for each gene 
and expression of each gene can be determined in different tumor types by nucleic acid hybridization 
techniques. The present invention provides a tool with which to identify potential genes coding for different 
members of the CEA family and to determine the theoretical primary amino acid sequences for them. Using 
the method of automated peptide synthesis, peptides can then be synthesized corresponding to unique 

45 sequences in these antigens. With these peptides, antibodies to these sequences can be produced which, 
in the intact CEA molecule, might not be recognized by the animal being immunized. Having accomplished 
this, advantage can then be taken of the differences in these antigens to generate specific immunoassays 
for the measurement of each antigen. 

A wide variety of host/cloning vehicle combinations may be employed in cloning the double-stranded 

so nucleic acid prepared in accordance with this invention. For example, useful cloning vehicles may consist of 
segments of chromosomal, non-chromosomal and synthetic DNA sequences, such as various known 
derivatives of SV40 and known bacterial plasmids, e.g., plasmids from E. coli including col E1, pCR1, 
pBR322, pMB89 and their derivatives, wider host range plasmids, e.g., RP4, and phage DNAs, e.g., the 
numerous derivatives of phage, e.g., NM989, and other DNA phages, e.g., M13 and Filamenteous single- 

55 stranded DNA phages and vectors derived from combinations of plasmids and phage DNAs such as 
plasmids which have been modified to employ phage DNA or other expression control sequences or yeast 
plasmids such as the 2 u plasmid or derivatives thereof. Useful hosts may include bacterial hosts such as 
strains of E. coli, such as E. coli HB 101, E. coli X1776, E. coli X2282, E. coli MRCI and strains of 
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Pseudomonas, Bacillus subtilis, Bacillus stearothermophilus and other E. coli, bacilli, yeasts and other fungi, 
animal or plant hosts such as animal (including human) or plant cells in culture or other hosts. Of course, 
not all host/vector combinations may be equally efficient. The particular selection of host/cloning vehicle 
combination may be made by those of skill in the art after due consideration of the principles sat forth 

5 without departing from the scope of this invention. 

Furthermore, within each specific cloning vehicle, various sites may be selected for insertion of the 
nucleic acid according to the present invention. These sites are usually designated by the restriction 
endonuclease which cuts them. For example, in pBR322 the Pstl site is located in the gene for beta- 
lactamase, between the nucleotide triplets that code for amino acids 181 and 182 of that protein. One of the 

70 two Hindll endonuclease recognition sites is between the triplets coding for amino acids 101 and 102 and 
one of the several Taq sites at the triplet coding for amino acid 45 of beta-lactamase in pBR322. In similar 
fashion, the EcoRI site and the PVUII site in this plasmid lie outside of any coding region, the EcoRI site 
being located between the genes coding for resistance to tetracycline and ampicillin, respectively. These 
sites are well recognized by those of skill in the art. It is, of course, to be understood that a cloning vehicle 

75 useful in this invention need not have a restriction endonuclease site for insertion of the chosen DNA 
fragment. Instead, the vehicle could be cut and joined to the fragment by alternative means. 

The vector or cloning vehicle and in particular the site chosen therein for attachment of a selected 
nucleic acid fragment to form a recombinant nucleic acid molecule is determined by a variety of factors, 
e.g., the number of sites susceptible to a particular restriction enzyme, the size of the protein to be 

20 expressed, the susceptibility of the desired protein to proteolytic degradation by host cell enzymes, the 
contamination of the protein to be expressed by host cell proteins difficult to remove during purification, the 
expression characteristics, such as the location of start and stop codons relative to the vector sequences, 
and other factors recognized by those of skill in the art. The choice of a vector and an insertion site for a 
particular gene is determined by a balance of these factors, not all sections being equally effective for a 

25 given case. 

Methods of inserting nucleic acid sequences into cloning vehicles to form recombinant nucleic acid 
molecules include, for example, dA-dT tailing, direct ligation, synthetic linkers, exonuclease and 
polymerase-linked repair reactions followed by ligation, or extension of the nucleic acid strand with an 
appropriate polymerase and an appropriate single-stranded template followed by ligation. 

30 It should also be understood that the nucleotide sequences or nucleic acid fragments inserted at the 
selected site of the cloning vehicle may include nucleotides which are not part of the actual structural gene 
for the desired polypeptide or mature protein or may include only a fragment of the complete structural 
gene for the desired protein or mature protein. 

The cloning vehicle or vector containing the foreign gene is employed to transform an appropriate host 

35 so as to permit that host to replicate the foreign gene and to express the protein coded by the foreign gene 
or portion thereof. The selection of an appropriate host is also controlled by a number of factors recognized 
by the art. These include, for example, the compatibility with the chosen vector, the toxicity of proteins 
encoded by the hybrid plasmid, the ease of recovery of the desired protein, the expression characteristics, 
biosafety and costs. A balance of these factors must be struck with the understanding that not all hosts may 

40 be equally effective for expression of a particular recombinant DNA molecule. 

The level of production of a protein is governed by two major factors: the number of copies of its gene 
within the cell and the efficiency with which those gene copies are transcribed and translated. Efficiency of 
transcription and translation (which together comprise expression) is in turn dependent upon nucleotide 
sequences, normally situated ahead of the desired coding sequence. These nucleotide sequences or 

4$ expression control sequences define inter alia, the location at which RNA polymerase interacts to initiate 
transcription (the promoter sequence) and aTwhich ribosomes bind and interact with the mRNA (the product 
of transcription) to initiate translation. Not all such expression control sequences function with equal 
efficiency. It is thus of advantage to separate the specific coding sequences for the desired protein from 
their adjacent nucleotide sequences and fuse them instead to other known expression control sequences so 

so as to favor higher levels of expression. This having been achieved, the newly engineered nucleic acid, e.g., 
DNA, fragment may be inserted into a multicopy plasmid or a bacteriophage derivative in order to increase 
the number of gene copies within the cell and thereby further improve the yield of expressed protein. 

Several expression control sequences may be employed as described above. These include the 
operator, promoter and ribosome binding and interaction sequences (including sequences such as the 

55 Shine-Dalgarno sequences) of the lactose operon of E. coli ("the lac system"), the corresponding 
sequences of the tryptophan synthetase system of E. coli ("the trp system"), the major operator and 
promoter regions of phage X (O l Pl and OrP'r), the control region of Filamenteous single-stranded DNA 
phages, or other sequences which control the expression of genes of prokaryotic or eukaryotic cells and 
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their viruses. Therefore, to improve the production of a particular polypeptide in an appropriate host, the 
gene coding for that polypeptide may be selected and removed from a recombinant nucleic acid molecule 
containing it and reinserted into a recombinant nucleic acid molecule closer or in a more appropriate 
relationship to its former expression control sequence or under the control of one of the above described 
5 expression control sequences. Such methods are known in the art. 

As used herein "relationship" may encompass many factors, e.g., the distance separating the expres- 
sion enhancing and promoting regions of the recombinant nucleic acid molecule and the inserted nucleic 
acid sequence, the transcription and translation characteristics of the inserted nucleic acid sequence or 
other sequences in the vector itself, the particular nucleotide sequence of the inserted nucleic acid 
io sequence and other sequences of the vector and the particular characteristics of the expression enhancing 
and promoting regions of the vector. 

Further increases in the cellular yield of the desired products depend upon an increase in the number 
of genes that can be utilized in the cell. This is achieved, for illustration purposes, by insertion of 
recombinant nucleic acid molecules engineered into the temperate bacteriophage X (NM989), most simply 
75 by digestion of the plasmid with a restriction enzyme, to give a linear molecule which is then mixed with a 
restricted phage X cloning vehicle (e.g., of the type described by N. E. Murray et al, "Lambdoid Phages 
That Simplify the Recovery of In Vitro Recombinants", Molec. Gen. Genet., 150, pp. 53-61 (1977) and N. E. 
Murray et al, "Molecular Cloning of the DNA Ligase Gene From Bacteriophage T4", J. Mol. Biol., 132, pp. 
493-505 (1979)) and the recombinant DNA molecule recircularized by incubation with DNA ligaieT The 
20 desired recombinant phage is then selected as before and used to lysogenize a host strain of E. coli. 

Particularly useful X cloning vehicles contain a temperature-sensitive mutation in the repression gene cl 
and suppressible mutations in gene S, the product of which is necessary for lysis of the host cell, and gene 
E, the product of which is major capsid protein of the virus. With this system, the lysogenic cells are grown 
at 32 °C and then heated to 45 °C to induce excision of the prophage. Prolonged growth at 37 °C leads to 
25 high levels of production of the protein, which is retained within the cells, since these are not lysed by 
phage gene products in the normal way, and since the phage gene insert is not encapsulated it remains 
available for further transcription. Artificial lysis of the cells then releases the desired product in high yield. 

In addition, it should be understood that the yield of polypeptides prepared in accordance with this 
invention may also be improved by substituting different codons for some or all of the codons of the present 
30 DNA sequences, these substituted codons coding for amino acids identical to those coded for by the 
codons replaced. 

Finally, the activity of the polypeptides produced by the recombinant nucleic acid molecules of this 
invention may be improved by fragmenting, modifying or derivatizing the nucleic acid sequences or 
polypeptides of this invention by well-known means, without departing from the scope of this invention. 
35 The polypeptides of the present invention include the following: 

(1) the polypeptides expressed by the above described cells, 

(2) polypeptides prepared by synthetic means, 

(3) fragments of polypeptides (1) or (2) above, such fragments produced by synthesis of amino acids or 
by digestion or cleavage. 

40 Regarding the synthetic peptides according to the invention, chemical synthesis of peptides is 
described in the following publications: S.B.H. Kent, Biomedical Polymers, eds. Goldberg, E.P. and 
Nakajima, A. (Academic Press, New York), 213-242, (1980); A.R. Mitchell, S.B.H. Kent, M. Engelhard and 
R.B. Merrifield, J. Org. Chem. , 43, 2845-2852, (1978); J.P. Tarn, T.-W. Wong, M. Riemen, F.-S. Tjoeng and 
R.B. Merrifield, Tet. Letters , 4033-4036, (1979); S. Mojsov, A.R. Mitchell and R.B. Merrifield, J. Org. Chem., 

45 45, 555-560, (1980); J.P. Tarn, R.D. DiMarchi and R.B. Merrifield, Tet. Letters, 2851-2854, (1981); and 
S.B.H. Kent, M. Riemen, M. Le Doux and R.B. Merrifield, Proceedings of the IV International Symposium on 
Methods of Protein Sequence Analysis, (Brookhaven Press, Brookhaven, NY), in press, 1981. 

In the Merrifield solid phase procedure, the appropriate sequence of L-amino acids is built up from the 
carboxyl terminal amino acid to the amino terminal amino acid. Starting with the appropriate carboxyl 

so terminal amino acid attached to a polystyrene (or other appropriate) resin via chemical linkage to a 
chloromethyl group, benzhydrylamine group, or other reactive group of the resin, amino acids are added 
one by one using the following procedure. The peptide-resin is: 

(a) washed with methylene chloride; 

(b) neutralized by making for 10 minutes at room temperature with 5% (v/v) diisopropylethylamine (or 
55 other hindered base) in methylene chloride; 

(c) washed with methylene chloride; 

(d) an amount of amino acid equal to six times the molar amount of the growing peptide chain is 
activated by combining it with one-half as many moles of a carbodiimide (e.g., dicyclohexylcarbodiimide, 
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or diisopropylcarbodiimide) for ten minutes at 0°C, to form the symmetric anhydride of the amino acid. 
The amino acid used should be provided originally as the N-alpha-tert-butyloxycarbonyl derivative, with 
side chains protected with benzyl esters (e.g., aspartic or glutamic acids), benzyl ethers (e.g., serine, 
threonine, cysteine or tyrosine), benzyloxycarbonyl groups (e.g., lysine) or other protecting groups 
5 commonly used in peptide synthesis; 

(e) the activated amino acid is reacted with the peptide-resin for two hours at room temperature, resulting 
in addition of the new amino acid to the end of the growing peptide chain; 

(f) the peptide-resin is washed with methylene chloride; 

(g) the N-alpha-(tert.-butyloxycarbonyl) group is removed from the most recently added amino acid by 
70 reacting with 30 to 65%, preferably 50% (v/v) trifluoroacetic acid in methylene chloride for 10 to 30 

minutes at room temperature; 

(h) the peptide-resin is washed with methylene chloride; 

(i) steps (a) through (h) are repeated until the required peptide sequence has been constructed. 

The peptide is then removed from the resin and simultaneously the side-chain protecting groups are 
75 removed, by reaction with anhydrous hydrofluoric acid containing 10% v/v of anisole or other suitable 
(aromatic) scavenger. Subsequently, the peptide can be purified by gel filtration, ion exchange, high 
pressure liquid chromatography, or other suitable means. 

In some cases, chemical synthesis can be carried out without the solid phase resin, in which case the 
synthetic reactions are performed entirely in solution. The reactions are similar and well known in the art, 
20 and the final product is essentially identical. 

Digestion of the polypeptide can be accomplished by using proteolytic enzymes, especially those 
enzymes whose substrate specificity results in cleavage of the polypeptide at sites immediately adjacent to 
the desired sequence of amino acids. 

Cleavage of the polypeptide can be accomplished by chemical means. Particular bonds between amino 
25 acids can be cleaved by reaction with specific reagents. Examples include the following: bonds involving 
methionine are cleaved by cyanogen bromide; asparaginyl-glycine bonds are cleaved by hydroxylamine. 
The present invention has the following advantages: 

(1) The nucleic acids coding for TM-1, TM-2 and TM-3 can be used as probes to isolate other members 
of the CEA gene family. 

30 (2) The nucleic acids coding for TM-1, TM-2 and TM-3 can be used to derive oligonucleotide probes to 
determine the expression of TM-1, TM-2, TM-3 and other CEA genes in various tumor types. 

(3) TM-1, TM-2, TM-3 and TM-4 nucleotide sequences can be used to predict the primary amino acid 
sequence of the protein for production of synthetic peptides. 

(4) Synthetic peptides derived from the above sequences can be used to produce sequence-specific 
35 antibodies. 

(5) Immunoassays for each member of the CEA antigen family can be produced with these sequence- 
specific antibodies and synthetic peptides. 

(6) The aforementioned immunoassays can be used as diagnostics for different types of cancer if it is 
determined that different members of the CEA family are clinically useful markers for different types of 

40 cancer. 

Peptides according to the present invention can be labelled by conventional means using radioactive 
moieties (e.g., 125 1), enzymes, dyed and fluorescent compounds, just to name a few. 

Several possible configurations for immunoassays according to the present invention can be used. The 
readout systems capable of being employed in these assays are numerous and non-limiting examples of 
45 such systems include fluorescent and colorimetric enzyme systems, radioisotopic labelling and detection 
and chemiluminescent systems. Two examples of immunoassay methods are as follows: 

(1) An enzyme linked immunoassay (ELISA) using an antibody preparation according to the present 
invention (including Fab or F(ab) 1 fragments derived therefrom) to a solid phase (such as a microtiter 
plate or latex beads) is attached a purified antibody of a specificity other than that which is conjugated to 
so the enzyme. This solid phase antibody is contacted with the sample containing CEA antigen family 
members. After washing, the solid phase antibody-antigen complex is contacted with the conjugated anti- 
peptide antibody (or conjugated fragment). After washing away unbound conjugate, color or fluorescence 
is developed by adding a chromogenic or fluorogenic substrate for the enzyme. The amount of color or 
fluorescence developed is proportional to the amount of antigen in the sample. 
55 (2) A competitive fluorometric immunoassay using fluorescently labelled peptide or synthetic peptides of 
the sequences for TM-2, TM-2, TM-3 and TM-4. In this example, the purified peptide expressed by cells 
or synthetic peptides thereof are fluorescently labelled. To a solid phase is attached a purified antibody. 
This solid phase is then contacted with sample containing CEA antigen family members to which has 
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been added fluorescent peptide probe. After binding, excess probe is washed away the amount of bound 
probe is quantitated. The amount of bound fluorescent probe will be inversely proportional to the amount 
of antigen in the sample. 

In the nucleic acid hybridization method according to the present invention, the nucleic acid probe is 
conjugated with a label, for example, an enzyme, a fluorophore, a radioisotope, a chemiluminescent 
compound, etc. In the most general case, the probe would be contacted with the sample and the presence 
of any hybridizable nucleic acid sequence would be detected by developing in the presence of a 
chromogenic enzyme substrate, detection of the fluorophore by epifluorescence, by autoradiography of the 
radioisotopically labelled probe or by chemiluminescence. The detection of hybridizable RNA sequences 
can be accomplished by (1) a dot blot methodology or (2) an in situ hybridization methodology. Methods for 
these last two techniques are described by D. Gillespie and J. Bresser, "mRNA Immobilization in Nal: Quick 
Blots", Biotechniques , 184-192, November/December 1983 and J. Lawrence and R. Singer, "Intracellular 
Localization of Messenger RNAs for Cytosketal Proteins", Cell, 45, 407-415, May 9, 1986, respectively. The 
readout systems can be the same as described above, e.g., enzyrne labelling, radiolabelling, etc. 

As stated above, the invention also relates to the use in medicine of the aforementioned complex of the 
invention. 

The invention further provides a pharmaceutical composition containing as an active ingredient a 
complex of the invention in the form of a sterile and/or physiologically isotonic aqueous solution. 

For parenteral administration, solutions and emulsions containing as an active ingredient the complex of 
the invention should be sterile and, if appropriate, blood-isotonic. 

It is envisaged that the active complex will be administered perorally, parenterally (for example, 
intramuscularly, intraperitoneally, or intravenously), rectally or locally. 

Example 1: Preparation of cDNA in pcE22 which codes for TM2-CEA [CEA-(e)] 

25 — — — 

Example 1a: RNA Preparation 

Messenger RNA was prepared by the proteinase K extraction method of J. Favolaro, R. Treisman and 
R. Kamen, Methods in Enzymology , 65, 718, Academic Press, Inc. (1980), followed by oligo dT cellulose 

30 chromatography to yield poly A+ RNA (3 f -polyadenylated eukaryotic RNA containing most mRNA se- 
quences that can be translated into polypeptides). To obtain approximately 100 ug of poly A+ RNA, 
approximately 3 x 10 8 cells of transfectant 23.411 (ATCC No. CRL 9731, deposited with the ATCC on June 
1, 1988), that expresses TM-1, TM-2, TM-3 and TM-4, Kamarck et al, Proc. Natl. Acad. Sci., USA , 84, 5350- 
5354, August 1987, were harvested from roller bottles after late logarithmic growth. Cells wereTysed by 

35 homogenization in an ice-cold solution of 140 mM NaCI, 1.5 mM MgCb, 10 mM Tris-HCI, pH 8.0, 0.5% 
NP40®, 4 mM dithiothreitol and 20 units of placental ribonuclease inhibitor/ml. Sodium deoxycholate was 
then added to 0.2%. Cytoplasm and nuclei were separated by centrifugation of the homogenate at 12,000xg 
for 20 minutes. The cytoplasmic fraction was mixed with an equal volume of 0.2 M Tris-HCI, pH 7.8, 25 mM 
EDTA, 0.3 M NaCI, 2% sodium dodecyl sulfate and 400 ug/ml of proteinase K, incubated for 1 hour at 

40 37 °C, then extracted once with an equal volume of phenol/cholorform (1:1/v:v) solution. Nucleic acids were 
obtained by ethanol precipitation of the separated aqueous phase. Total RNA was enriched by passage in 
0.5 M NaCI, 10 mM Tris-HCI, pH 7.8, 0.1% sarcosyl® through an oligo dT(12-18) cellulose column. After 
washing, bound RNA was eluted in the same solution without sodium chloride. 

45 Example 1b: Reverse Transcription of mRNA 

Ten micrograms of poly A+ RNA were primed for reverse transcription with oligo dT(12-18) and pdN s 
primers. One hundred microliter reaction was performed for 4 hours at 42 • C with 200 units AMV reverse 
transcriptase (Life Science, Inc. St. Petersburg, Florida, U.S.A.). The RNA component of the cDNA/mRNA 
so hybrids was replaced with the second complementary strand by treatment with RNase H, E. coli DNA 
polymerase I and E. coli DNA ligase at 12° C and 22 *C for 1.5 hours each. Molecular ends were~polished 
by treatment with T4 DNA polymerase. cDNA was phenol/chloroform extracted and purified over a 
"SEPHADEX® G-50" spun column prepared in 10 mM Tris-HCI, pH 7.8, 1 mM EDTA (TE). 

55 Example 1c: Cloning of pcE22 (plasmid cDNA E22) 



Synthetic DNA linkers 5' pCCCGGG 3' 

3 f GGGCCCTTAA 5' 
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were attached to the ends of cDNA by blunt end ligation with excess T4 DNA ligase. Excess linkers were 
removed by chromatography through "SEPHADEX® G-50" (medium) in TE, and by fractionation on 0.8% 
low melting agarose gel. Based on Northern blot analysis of poly A+ RNA of the 23.411 cell line, the size 
of the CEA-related mRNA was estimated at 3.6 kb. Therefore, cDNA fragments between 2 and 4 kb were 
5 recovered from gel slices and fragments were ethanol precipitated. After resuspension of cDNA in TE, 
EcoRI-cleaved lambda gt10 arms were added to cDNA at an estimated molar ratio of 1:1. Ligation 
proceeded at 7 • C for 2 days in the presence of T4 DNA ligase. Aliquots of the ligation reaction were added 
to commercially-obtained packaging mix (Stratagene, San Diego, California, U.S.A.). Five million phage 
particles were obtained ofter in vitro packaging and infection of E. coli host NM514. 

10 

Example 1d: Screening of Recombinant Library 

Five hundred thousand packaged lambda particles were plated on lawns of E. coli NM514 and replicate 
patterns were lifted onto nitrocellulose sheets as described by W.D. Benton and R.W. Davis, Science 196, 
75 180-182, (1977). Positive phage were selected by hybridization with 32 P-labeled LV7 cDNA insert probelhat 
contained a domain repeated amoung various CEA family members. By multiple rounds of screening. 
Phage from individual plaques were amplified and titered, and these were used to prepare small quantities 
of recombinant phage DNA. 

20 Example 1e: DNA Manipulation 

Phage DNA was prepared according to T. Maniatis, E. Fritsch and J. Sambrook, Molecular Cloning, A 
Laboratory Manual, Cold Spring Habor, (1982). DNA segments were isolated from low melting agarose gels 
and inserted for subcloning into Bluescript plasmid vectors (Stratagene, San Diego, California, U.S.A.). DNA 
25 sequencing was performed by the dideoxy termination method of F. Sanger, S. Nicklen and A. Coulson, 
Proc. Natl. Acad. Sci., U.S.A. , 74, 5463-5467, (1977). The nucleic acid and translated sequence for cDNA in 
pcE22 is given hereinabove (Tj\/T-2 (CEA-(e)). 

Example 2: Preparation of cDNA in pcHT-6 which Partically Codes for TM3-CEA [CEA-(f)] 
30 ~~ ' " " 

Example 2a: RNA Preparation 

Messenger RNA was prepared by the proteinase K extraction method of J. Favolaro, R. Treisman and 
R. Kamen, Methods in Enzymology, 65 718, Academic Press, Inc. (1980), followed by oligo dT cellulose 

35 chromatography to yield poly A+ RNA (3'-polyadenylated eukaryotic RNA containing most mRNA se- 
quences that can be translated into polypeptides). To obtain approximately 100 ug of poly A+ RNA, 
approximately 3 x 10 8 cells of HT-29 tumor cells (ATCC HTB38) were harvested form roller bottles after late 
logarithmic growth. Cells were lysed by homogenization in an ice-cold solution of 140 mM NaCI, 1.5 mM 
MgCI 2 , 10 mM Tris-HCI, pH 8.0, 0.5% NP40®, 4 mM dithiothreitol and 20 units of placental ribonuclease 

40 inhibitor/ml. Sodium deoxycholate was then added to 0.2%. Cytoplasm and nuclei were separated by 
centrifugation of the homogenate at 12,000xg for 20 minutes. The cytoplasmic fraction was mixed with an 
equal volume of 0.2 M Tris-Hcl, pH 7.8, 25 mM EDTA, 0.3 M NaCI, 2% sodium dodecyl sulfate and 400 
ug/ml of proteinase K, incubated for 1 hour at 37 W C, then extracted once with an equal volume of 
phenol/cholorform (1:1 /v:v) solution. Nucleic acids were obtained by ethanol precipitation of the separated 

45 aqueous phase. Total RNA was enriched by passage in 0.5 M NaCI, 10 mM Tris-HCI, pH 7.8, 0.1% 
sarcosyl® through an oligo dT(12-18) cellulose column. After washing, bound RNA was eluted in the same 
solution without sodium chloride. 

Example 2b: Reverse Transcription of mRNA 

so 

Ten micrograms of HT-29 poly A+ RNA were primed for reverse transcription with oligo dT(12-18) and 
pdN6 primers. One hundred microliter reaction was performed for 4 hours at 42 *C with 200 units AMV 
reverse transcriptase (Life Science, Inc. Si Petersburg, Florida, U.S.A.). The RNA component of the 
cDNA/mRNA hybrids was replaced with the second complementary strand by treatment with RNase H, E. 
55 coli DNA polymerase I and E. coli DNA ligase at 12 °C and 22 °C for 1.5 hours each. Molecular ends were 
polished by treatment with T4 DNA polymerase. cDNA was phenol/chloroform extracted and purified over a 
"SEPHADEX® G-50" spun column prepared in 10 mM Tris-HCI, pH 7.8, 1 mM EDTA (TE). 
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Example 2c: Cloning of pcHT-6 (plasmid cDNA HT-6) 

Synthetic DNA linkers 5' pCCCGGG 3' 

3' GGGCCCTTAA 5' 

5 were attached to the ends of cDNA by blunt end ligation with excess T4 DNA ligase. Excess linkers were 
removed by chromatography through "SEPHADEX® G-50" (medium) in TE, and by fractionation on 0.8% 
low melting agarose gel. Based on Northern blot analysis of poly A+ RNA of the HT-29 cell line, the size of 
the CEA-related mRNA was estimated at 2.2 kb. Therefore, cDNA fragments between 2 and 3 kb were 
recovered from gel slices and fragments were ethanol precipitated. After resuspension of cDNA in TE, 

10 EcoRI-cleaved lambda gt10 arms were added to cDNA at an estimated molar ratio of 1:1. Ligation 
proceeded at 7 • C for 2 days in the presence of T4 DNA ligase. Aliquots of the ligation reaction were added 
to commercially-obtained packaging mix (Stratagene, San Diego, California, U.S.A.). Five million phage 
particles were obtained ofter in vitro packaging and infection of E. coli host NM514. 

15 Example 2d : Screening of Recombinant Library 

Five hundred thousand packaged lambda particles were plated on lawns of E. coli NM514 and replicate 
patterns were lifted onto nitrocellulose sheets as described by W.D. Benton and~R~!w. Davis, Science, 196, 
180-182, (1977). Positive phage were selected by hybridization with 32 P-labeled LV7 cDNA insert probelhat 
20 contained a domain repeated amoung various CEA family members. By multiple rounds of screening. 
Phage from individual plaques were amplified and titered, and these were used to prepare small quantities 
of recombinant phage DNA. 

Example 2e: DNA Manipulation 

25 ' 

Phage DNA was prepared according to T. Maniatis, E. Fritsch and J. Sambrook, Molecular Cloning, A 
Laboratory Manual , Cold Spring Habor, (1982). DNA segments were isolated from low melting agarose geii 
and inserted for subcloning into Bluescript plasmid vectors (Stratagene, San Diego, California, U.S.A.). DNA 
sequencing was performed by the dideoxy termination method of F. Sanger, S. Nicklen and A. Coulson, 

30 Proc. Natl. Acad. Sci., U.S.A. , 74, 5463-5467, (1977). The nucleic acid and translated sequence for cDNA in 
HT-6 not complete at the 5' end of its coding region, but the nucleotide sequenece and restriction map of 
the HT-6 insert indicates that it is related to nucleic acid sequences of cDNA clones coding for CEA-(c) and 
CEA-(e). The nucleotide sequence of HT-6 insert differs from these clones at only nucleotide position 1463 
to 1515 and 1676 to 2429 of the CEA-(c) cDNA. It is inferred from this result that the pcHT-6 insert is a 

35 partial coding sequence for CEA-(f) and the presumed nucleic acid and translated sequence of CEA-(f) is 
given hereinabove (TM-3 (CEA-(f)). 

Example 3 : Preparation of cDNA which codes for TM4-CEA [CEA-(g)] 

40 Example 3a: RNA Preparation 

Messenger RNA is prepared by the proteinase K extraction method of J. Favolaro, R. Treisman and R. 
Kamen, Methos in Enzymology , 65, 718, Academic Press, Inc. (1980), followed by oligo dT cellulose 
chromatography to yield poly A+H RNA (3 f -polyadenylated eukaryotic RNA containing most mRNA se- 

45 quences that can be translated into polypeptides). To obtain approximately 100 ug of poly A+ RNA, 
approximately 3 x 10 s cells of transfectant 23.411 or tumor cell line HT-29 (ATCC HTB 38) are harvested 
from roller bottles after late logarithmic growth. Cells are lysed by homogenization in an ice-cold solution of 
140 mM NaCI, 1.5 mM MgCI 2 , 10 mM Tris-HCI, pH 8.0, 0.5% NP40®, 4 mM dithiothreitol and 20 units of 
placental ribonuclease inhibitor/ml. Sodium deoxycholate was then added to 0.2%. Cytoplasm and nuclei 

50 are separated by centrifugation of the homogenate at 12,000xg for 20 minutes. The cytoplasmic fraction is 
mixed with an equal volume of 0.2 M Tris-Hcl, pH 7.8, 25 mM EDTA, 0.3 M NaCI, 2% sodium dodecyl 
sulfate and 400 ug/ml of proteinase K, incubated for 1 hour at 37 *C, then extracted once with an equal 
volume of phenol/cholorform (1:1/v:v) solution. Nucleic acids are obtained by ethanol precipitation of the 
separated aqueous phase. Total RNA is enriched by passage in 0.5 M NaCI, 10 mM Tris-HCI, pH 7.8, 0.1% 

55 sarcosyl through an oligo dT(12-18) cellulose column. After washing, bound RNA is eluted in the same 
solution without sodium chloride. 
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Example 3b: Reverse Transcription of mRNA 

Ten micrograms of 23.411 or HT 29 poly A+ RNA are primed for reverse transcription with oligo dT(12- 
18) and pdNs primers. One hundred microliter reaction was performed for 4 hours at 42 °C with 200 units 
5 AMV reverse transcriptase (Life Science, Inc. St. Petersburg, Florida, U.S.A.). The RNA component of the 
cDNA/mRNA hybrids is replaced with the second complementary strand by treatment with RNase H, E. coli 
DNA polymerase I and E. coli DNA ligase at 12°C and 22 *C for 1.5 hours each. Molecular ends are 
polished by treatment with T4~DNA polymerase. cDNA is phenol/chloroform extracted and purified over a 
"SEPHADEX® G-50" spun column prepared in 10 mM Tris-HCI, pH 7.8, 1 mM EDTA (YE). 

10 

Example 3c: Cloning of cDNA for CEA-(g) 

Synthetic DNA linkers 5' pCCCGGG 3' 

3' GGGCCCTTAA 5' 

75 are attached to the ends of cDNA by blunt end ligation with excess T4 DNA ligase. Excess linkers are 
removed by chromatography through "SEPHADEX® G-50" (medium) in TE, and by fractionation on 0.8% 
low melting agarose gel. Based on Northern blot analysis of poly A+ RNA of the 23.411 and HT-29 cell 
lines, the size of the CEA-related mRNA is estimated at 1.7 kb. Therefore, cDNA fragments between 1 and 
2 kb are recovered from gel slices and fragments are ethanol precipitated. After resuspension of cDNA in 

20 TE, EcoRI-cleaved lambda gt10 arms are added to cDNA at an estimated molar ratio of 1:1. Ligation 
proceeds at 7 • C for 2 days in the presence of T4 DNA ligase. Aliquots of the ligation reaction are added to 
commercially-obtained packaging mix (Stratagene, San Diego, California, U.S.A.). Phage particles are 
obtained after in vitro packaging and infection of E. coli host NM514. 

25 Example 3d: Screening of Recombinant Library 

Five hundred thousand to one million packaged lambda particles are plated on lawns of E. coli NM514 
and replicate patterns are lifted onto nitrocellulose sheets as described by W.D. Benton and RiW. Davis, 
Science, 196, 180-182, (1977). Positive phage are selected by hybridization with ^P-labeled LV7 cDNA 
30 insert probe" that contained a domain repeated amoung various CEA family members. By this selection 
method, positive phage are obtained after multiple rounds of screening. Phage from individual plaques are 
amplified and titered, and these are used to prepare small quantities of recombinant phage DNA. 

Example 3e: DNA Manipulation 

35 

Phage DNA is prepared according to T. Maniatis, E. Fritsch and J. Sambrook, Molecular Cloning , A 
Laboratory Manual, Cold Spring Harbor, (1982). DNA segments are isolated from low melting agarose gels 
and inserted for subcloning into Bluescript plasmid vectors (Stratagene, San Diego, California, U.S.A.). DNA 
sequencing is performed by the dideoxy termination method of F. Sanger, S. Nicklen and A. Coulson, Proc. 
40 Natl. Acad. Sci., U.S.A., 74, 5463-5467, (1977). The nucleotide and translated sequence for a cDNA coding 
for CEA-(g) is given hereinabove (TM-4 (CEA-(g)). 

Example 4 : Screening of a KG-1 cDNA Library with 32 P-labelled CEA Probe, LV7 (CEA-(A)) 

45 A segment of cDNA coding for a portion of carcinoembryonic antigen [LV7 or CEA-(a)] was radiolabel- 
ed by random priming and used to detect homologous sequences on filter replicas of a commercial cDNA 
library prepared from KG-1 cells in bacteriophage vector X gt11 (Clontech Laboratories, Inc., Palo Alto, CA., 
U.S.A.). Hybridizations were performed at 68 °C in 2xSSSPE (IxSSPE - 0.15 M NaCI, 0.01 M sodium 
phosphate and 1 mM EDTA, pH 7), 5x Denhardt's solution and 100 ug of denatured salmon sperm DNA per 

so ml, and post-hybridization washes were in 0.2xSSC, 0.25% sodium dodecyl sulfate. 

Positive phage were picked, rescreened to homogeneity, and amplified for production of DNA. cDNA 
inserts were excised from phage DNA with EcoRI endonuclease and subcloned into the EcoRI site of the 
plasmid vector pBluescript KS. DNA sequencing on double-stranded DNA was by the method of Sanger et 
al, supra. The sequences of two different inserts from the KG-1 cDNA library are shown below: 

55 
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pcKGCEAl: 

1 acagcacagc tgacagccg tact caggaagcttc tgga tec taggct tat ctccacagag 6 0 

5 61 gagaacacacaagcagcagagaccatggggcccctctcagcccctccctgcacacacctc 120 

Me tGlyProLeuSe rAlaProProCysThcHi sLeu 

121 atcacttggaagggggtcctgctcacagcatcacttttaaacttctggaatccgcccaca 18 0 
IleThrTrpLysGlyValLeuLeuThrAlaSerLeuLeuAsnPheTrpAsnProProThr 

10 181 actgcccaagtcacgattgaagcccagccacccaaagtttctgaggggaaggatgttctt 24 0 
ThrAlaGlnValThrlleGluAlaGlnProProLysValSecGluGlyLysAspValLeu 

241 ctacttgtccacaatttgccccagaatcttgctggctacatttggtacaaagggcaaatg 300 
LeuLeuValHisAsnLeuProGlnAsnLeuAlaGlyTyrlleTrpTyrLysGlyGlnMet 

15 301 acatacgtctaccattacattacatcatatgtagtagacggttcaaagaattatatatggg 360 
ThrTyrValTyrHisTyrlleThrSerTyrValValAspGlyGlnArgllelletyrGly 

361 cctgcatacagtggaagagaaagagfeatattccaatgcatccctgctgaitccagaatgtc 4 20 
ProAlaTyrSerGlyArgGluArgValTyrSerAsnAlaSerLeuLeuIleGlnAsnVal 

20 421 acgcaggaggatgcaggatcctacaccttacacatcataaagcgacgcgatgggactgga 480 
ThrGlnGluAspAlaGlySerTyrThrLeuHisllelleLysArgArgAspGlyThrGly 

4 81 ggagtaac tgga catttcacc ttcaccttacacctggagactcccaagccctccatc tec 54 0 
GlyValThrGlyHisPheThrPhaThrLeuHisLeuGluThrProLysProSerlleSer 

25 541 agcagcaacttaaatcccagggaggccatggaggctgtgatcttaacctgtgatcctgcg 600 
SerSerAsnLeuAsnProArgGluAlaMetGluAlaVallleLeuThrCysAspProAla 

601 actccagccgcaagctaccagtggtggatgaatggtcagagcctccctatgactcacagg 66 0 
ThrProAlaAlaSerTyrGlnTrpTrpMetAsnGlyGlnSerLeuProMetThrHisArg 

30 t>61 ttgcagctgtccaaaaccaacaggaccctctttatatttggtgtcacaaagtatattgca 720 
LeuGlnLeuSerLysThrAsnArgThrLeuPhellePheGlyValThrLysTyrlleAla 

721 ggaccctatgaatgtgaaatacggaacccagtgagtgccagccgcagtgacccagtcacc 7 80 
GlyProTyrGluCysGluIleArgAsnProValSerAlaSerArgSerAspProValThr 

35 781 ctgaatctcctcccaaagctgtccaagccctacatcacaatcaacaacttaaaccccaga 84 0 
LeuAsnLeuLeuProLysLeuSerLysProTyrlleThrlleAsnAsnLeuAsnProArg 

841 gagaataaggatgtcttaaccttcacctgtgaacctaagagtgagaactacacctacatt 90 0 
GluAsnLysAspValLeuThrPheThrCysGluProLysSerGluAsnTyrThrTyrl le 

40 g 01 tggtggctaaatggtcagagcctccctgtcagtcccagggtaaagcgacccattgaaaac 960 
TrpTrpLeuAsnGlyGlnSerLeuProValSerProArgValLysArgProIleGluAsn 

961 aggatcctcattctacccaa tgtcacgagaaa tgaaacaggacct ta tcaatgtgaaa ta 10 20 
AcglleLeulleLeuProAsnValThcArgAsnGluThcGlyProTyrGlnCysGluIle 

45 1021 cgggaccgatatggtggcatccgcagtgacccagtcaccctgaatgtcctctatggtcca 108 0 
ArgAcpArgTyrGlyGlylleAcgSerAspProValThrLeuAsnValLeuTyrGlyPro 
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1081 gacctccccagcatttacccttcattcacctattaccgttcaggagaaaacctctacttt 1140 
AspLeuPcoSerlleTyrProSerPheThrTyrTyrArgSerGlyGluAsnLeuTyrPhe 

1141 tcctgcttcggtgagtctaacccacgggcacaatattcttggacaattaatgggaagttt 1200 
SecCysPheGlyGluSerAsnProArgAlaGlnTyrSerTrpThrlleAsnGlyLysPhe 

1201 cagctatcaggacaaaagctctctatcccccaaataactacaaagcatagtgggctctat 1260 
GlnLeuSerGlyGlnLysLeuSerlleProGlnlleThrThrLysHisSerGlyLeuTyr 

1261 gcttgctctgtfccgtaactcagccactggcaaggaaagctccaaatccatcacagtcaaa 13 20 
AlaCysSerValArgAsnSerAlaThrGlyLysGluSerSerLysSerlleThryalLys 

1321 gtctctgactggatattaccctgaattctactagttcctccaattccattttctcccatg 1380 
ValSerAspTrplleLeuProEnd 



1"1 gaatcacgaagagcaagacccactctgttccagaagccctataatctggaggtggacaac 144 0 

1**1 tcgatgtaaatttcatgggaaaacccttgtacctgacatgtgagccactcagaactcacc 150 0 

1501 aaaatgttcgacaccataacaacagctactcaaactgtaaaccaggataagaagttgatg 1560 

1561 acttcacactgtggacagtttttcaaagatgtcataacaagactccccatcatgacaagg 1620 

1621 ctccaccctctactgtctgctcatgcctgcctctttcacttggcaggataatgcagtcat 1680 

1681 tagaatttcacatgtagtagcttctgagggtaacaacagagtgtcagatatgtcatctca 1740 

1741 acctcaaacttttacgtaacatctcagggaaatgtggctctctccatcttgcatacaggg 18 00 

18 01 ctcccaatagaaatgaacacagagatattgcctgtgtg tttgcagagaagatggtttcta 1860 

1861 taaagagtaggaaagctgaaattatagtagagtctcctttaaatgcacattgtgtggatg 1920 

1921 gctctcaccatttcctaagagatacagtgtaaaaacgtgacagtaatactgattctagca 1980 

1981 gaataaacatgtaccacatttgcaaaaaa ZU1U 

pcKGCEA2: 

1 gggtggatcctaggctcatctccataggggagaacacacatacagcagaga.ccatggga 59 

MetGly 

60 cccctctcagcccctccctgcactcagcacatcacctggaaggggctcctgctcacagca 119 
ProLeuSerAlaProPcoCysThrGlnHisIleThrTrpLysGlyLeuLeuLeuThrAla 

120 tcacttttaaacttctggaacctgcccaccactgcccaagtaataattgaagcccagcca 179 
SerLeuLeuAsnPheTrpAsnLeuProThrThrAlaGlnValllelleGluAlaGlnPro 

180 cccaaagtttctgaggggaaggatgttcttctacttgtccacaatttgccccagaatctt 2 39 
ProLysValSerGluGlyLysAspValLeuLeuLeuValHisAsnLeuProGlnAsnLeu 

2 40 actggctacatctggtacaaagggcaaatgacggacctctaccattacattacatcatat 239 
ThrGlyTyrlleTrpTyrLysGlyGlnMetThrAspLeuTyrHisTyrlleThrSerTyr 

300 gtagtagacggtcaaattatatatgggcctgcctacagtggacgagaaacagtatattcc 359 
ValValAspGlyGlnllelleTyrGlyProAlaTyrSerGlyArgGluThrValTyrSer 

360 aatgcatccctgctgatccagaatgtcacacagga-ggatgcaggatcctacaccttacac 419 
AsnAlaSerLeuLeuIleGlnAsnvalThrGlnGluAspAlaGlySerTyrThrLeuHis 

4 20 atcataaagcgaggcgatgggactggaggagtaactggatatttcaCtgtcaccttatac 479 
HelleLysArgGlyAspGlyThrGlyGlyValThrGlyTyrPheThrValThrLeuTyr 

480 tcggagactcccaagcgctccatctccagcagcaacttaaaccccagggaggtcatggag 539 
SerGluThrProLysArgSerlleSerSerSerAsnLeuAsnProArgGluValMetGlu 
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54 0 gctgtgcgcttaatctgtgatcctgagactccggatgcaagctacctgtggttgc tgaat 599 
AlaValArgLeuIleCysAspProGluThrPcoAspAlaSe rTyr LeuTcpLeuLeuAsn 

600 ggtcagaacctccctatgactcacaggttgcagctgtccaaaaccaacaggaccctctat 659 
GlyGlnAsnLeuProMetThrHisArgLeuGlnLeuSerLysThrAsnArgThrLeuTyr 

660 ctatttggtgtcacaaagtatattgcagggccctatgaatgtgaaatacggaggggagtg 719 
LeuPheGlyValThrLysTyrlleAlaGlyProTyrGluCysGluIleArgArgGlyVal 

720 agtgccagccgcagtgacccagtcaccctgaatctcctcccgaagctgcccatgccttac 7 79 
SerAlaSerArgSerAspProValThrLeuAsnLeuLeuProLysLeuProMetProTyr 

780 atcaccatcaacaacttaaaccccagggagaagaaggatgtgttagccttcacctgtgaa 839 
IleThrlleAsnAsnLeuAsnProArgGluLysLysAspValLeuAlaPheThrCysGlu 

84 0 cctaagagtcggaactacacctacatttggtggctaaatggtcagagcctcccggtcagt 899 
75 ProLysSerArgAsnT^rThrTyrlleTrpTcpLeuAsnGlyGlnSerLeuProValSec 

900 ccgagggtaaagcgacccattgaaaacaggatactcattctacccagtgtcacgagaaat 9 59 
ProAcgValLysArgProIleGluAsnArglleLeuIleLeuProServalThrAcgAsn 

960 gaaacaggaccctatcaatgtgaaatacgggaccgatatggtggcatccgcagtaaccca 1019 
20 GluThrGlyProTyrGlnCysGluIleArgAspArgTyrGlyGlylleArgSerAsnPro 

1020 gtcaccctgaatgtcctctatggtccagacctccccagaatttacccttacttcacctat 107 9 
ValThrLeuAsnValLeuTyrGlyProAspLeuProArglleTyrProTyrPheThrTyr 



10 



25 



30 



35 



40 



45 



1080 taccgttcaggagaaaacctcgacttgtcctgctttgcggactctaacccaccggcagag 1139 
TyrArgSerGlyGluAsnLeuAspLeuSerCysPheAlaAspSerAsnProProAlaGlu 

114 0 tatttttggacaattaatgggaagtttcagctatcaggacaaaagctctttatcccccaa 119 9 
TyrPheTrpThrlleAsnGlyLysPheGlnLeuSerGlyGlnLysLeuPhelleProGIn 

i j0 attactacaaatcatagcgggctctatgcttgctctgttcgtaactcagccactggcaag 1259 
IleThrThrAsnHisSerGlybeuTyrAlaCysSerValAcgJ^snSerAlaThrGlyLys 

1260 gaaatctccaaatccatgatagtcaaagtctctggtccctgccatggaaaccagacagag 1319 
GluIleSerLysSerMetlleValLysValSerGlyProCysHisGlyAsnGlnThrGlu 

1320 tctcattaatggctgccacaatagagacactgagaaaaagaacaggttgataccttcatg 137 9 
SerHisEnd 

1380 aaattcaagacaaagaagaaaaaggctcaatgttattggactaaataatcaaaaggataa 14 39 

14 40 tgttttcataatttttattggaaaatgtgctgattcttggaatgttttVttctccagatt 14 99 

1500 tatgaactttttttcttcagcaattggtaaagtatacttttgtaaacaaaaattgaaaca 1559 
1560 tttgcttttgctctctatctgagtgccccccc 1591 



It will be appreciated that the instant specification and claims are set forth by way of illustration and not 
limitation and that various modifications and changes may be made without departing from the scope of the 
present invention. 

Claims 

1. A nucleic acid comprising a base sequence which codes for a peptide sequence, characterized in that 
the group nucleic acid is a DNA selected from the following group of five sequences: 
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10 30 50 

CACCCGTCCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGAC AGGGCCA 

5 

70 90 110 

GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
KetGlyHisLeuSe cAlaProLeuHi sArgValAryValProTrpGln 

70 

130 150 170 

GGGCTTCTGCTC ACAGCCTCACTTCTAACCTTCTGGAACC CGCCCACC ACTGCCCAGC'l C 
75 GlyLeuLeuleuThrAlaSe r Le u LeuTh r?heTrpAsn?ro?f oTh rTh r AlaGi nLe-j 



190 210 230 

20 ACTACTGAATCCATGCCATTC.^TGTTGCAGAGGGGAJsGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSecMe tProPheAsnValAlaGluClyLysGluValLeuLeuLeuValKi s 

250 270 290 

AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
25 AsnLeuPcoGlnGlnLeuPheGlyTy rSe r TrpTy rLysGlyGluAr cjValAspGl yAsn 



310 330 350 

30 CGTCAAATTGTAGGATATGCAATAGGAJvCTC/^C AAGCTACCCCAGGGCCCGC/\--.AC AGC 
ArcGlnlleValGlyTyrAlalleGlyThL-GlnGlnAlaThrProGlyProAlaAsnSer 



370 390 410 

35 ...... 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAA.CGTCACCCACAATGAC 
GlyArgGluTh r 1 1 e Ty r P r oAsnAl a S e r Leu Leu 1 1 eGl nAsnVs 1 Th rGl nAsnA s p 



40 *30 450 470 

ACAGGATTCTACACCCTACAAGTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGA 
ThrGlyPheTy rThrLeuGlnVall 1 eLysSe rAspLeuValAsnGluGluAl aTh rGly 

45 



50 



55 



EP 0 346 710 B1 



<90 S10 530 

CAGTTCCATC7ATACCCGCAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
5 GlnPheHisvalTycProGluLeuProLysProSerl leSerSe cAsnAsnSe r AsnP r o 

550 570. 590 

70 GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAlaValAlaPheThrCysGluProGluThcGlnAspThrThrTyr 

610 630 650 

75 • • • • 

CTGTGGTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAA7GGC 
LeuTrpTrpIleAsnAsnGlnSerLeuProValSe rProArgLeuGlnLeuSe rAsnGly 

20 670 690 710 

AACAGGACCCTCACTCTACTCAGTGTCACAAGG AATGACACAGGACCCTATGAGTGTGAA 
AsnAcgThcLeuThcLeuLeuSe r Va lTh r A rgAs nAspTh r Gl y ? coTy rGluCysGlu 

25 7 3 0 7 50 770 

ATACAGAACCCAGTGAGTGCGAACCCCAGTGACCCAGTCACCTTCAATGTCACCTATGGC 
1 1 eGl nAsn? roVa 1 S e r Al aAsnA r gSe c AspP r oVa lTh r Le uAsnVa 1 ThrTyrGly 

30 

79 0 010 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTC AGC 
ProAspThrProThr IleSerProSe rAspThrTy rTyrArgProGlyAl aAsnLeuSe r 

35 

850 870 890 

CTCTCCTGCTATCCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGGAACA 
4° LeuSerCysTyrAlaAlaSe rAsnProProAlaGlnTyrSe r Tip Leu I le'AsnGlyThr 

910 930 950 

45 TTCCAGCA-^AGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAJVTAGTGGATCC 
PheGlnGlnSerThrGlnGluLeuPhelleProAsnlleThrValAsnAsnSerGlySer 

970 990 1010 

tatacctgccacgccaataactcagtcactggctgcaacaggaccacagtcaacacgatc 

TytThrCysHitAUAsnAsnSe cValThrGlyCysAsnAcgThr Th rValLysThr lie 
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1030 1050 1070 

, . . • • • 

ATAGTCACTCATAATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGCCCATTGCTGGC 
1 1 e Va ITh r AspAs nAl a Leu P r oGlnGluAsnGl y Le uSe rPcoGlyAlalleAlaGly 

1090 1110 1130 

ATTGTGATTGGAGTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTT7CTG 
w IleVallleGlyValvalAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeu 

1150 1170 1190 



15 



CATTTCGGGAAGACCGGCAGGGCAAGCGACCAGCGTGATCTCA.CAGAGCACAAACCCTC A 
Hi sPheGlyLysThrGlyArgAlaSe cAspGlnArgAspLeuThrGluHi sLys? r oSe r 

1210 1230 1250 

20 GTCTCCAACCACACTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGAAGTTACT 
ValSerAsnHisThrGlnAspHisSerAsnAspProProAsnLysttetAsnGluValThr 



25 



1270 1290 1310 

TATTCTACCCTGAACTTTGAAGCCCAGCAACCCACAC AACCAACTTCAGCCTCCCC ATCC 
TyrSerThrLeuAsnPheGluAlaGlnGlnProTh cGl nP r oTh c S e rAlaSerProSer 

1330 1350 1370 

CTAACAGCCACAGAAATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCTGTCCTGC 
LeuThrAlaThrGluI le I leTyrSe rGluVal LysLysGln 

35 1 39 0 1 410 1 430 

TCACTGCAGTGCTGATCTATTTCA/vGTCTCTCACCCTCATCACTAGGAGATTCCTTTCCC 



30 



40 



45 



50 



1450 1470 1490 

CTGTAGGGTAGAGGGGTGGGGACAGAAACAACTTTCTCCTACTCTTCCTTCCTAATAGCC 

1510 1530 15S0 

ATCTCCAGGCTGCCTGGTCACTGCCCCTCTCTCAGTGTCAATAGATGAAAGTACATTGGG 

1570 1590 1 610 

AGTCTGTAGG AAACCC AACCTTCTTGTCATTGAAATTTGGCAAAGCTG ACTTTGGG AAAG 
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1630 1650' 1670 

* 

ACGGACCAGAACT7CCCCTCCCTTCCCCTTTTCCCAACCTGGACTTGTTTTAAACTTGCC 

5 

169.0 17.10 . . 17.30 . . 

TGTTCAGAGCACTCATTCCTTCCCACCCCCAGTCCTGTCCTATCACTCTAATTCGGATTT 

10 

1750 1770 1790 

GCCATAGCCTTGAGGTTATGTCCTTTTCCATTAAGTACATGTGCCAGGAAACAGCGACAG 

1810 1830 1850 

ACAGAAAGTAAACGGCAGTAATGCTTCTCCTATTTCTCCAAAGCCTTGTGTGAACTAGCA 

20 

1870 1890 1910 

AAGAGAAGAAAATCAAATATATAACCAATAGTGAAATGCCACAGGTTTGTCCACTGTCAG 

25 

1930 1950 1970 

GGTTGTCTACCTGTAGGATCAGGGTCTAAGCACCTTGGTGCTTAGCTAGAATACCACCTA 

30 

1990 2010 2030 

ATCCTTCTGGCAAGCCTGTCTTCAGAGAACCCACTAGAAGCAACTAGGAAAAATCACTTG 

35 

2050 2070 2090 

CCAAAATCCAAGGCAATTCCTGATGGAAAATGC/\AAAGCACATATA7GTT'rTAATATCTT 

40 

2110 2130 2150 

TATGGGCTCTGTTCAAGGCACTGCTGAGAGGGAGGGGTTATAGCTTCAGGAGGGAACCAG 

45 

21^0 2190 2210 

CTTCTGATAAAqACAATCTGCTAGGAACTTGGGAAAGGAATCAGAGAGCTCCCCTTCAGC 

50 
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i 
\ 



2230 2250 2270 

. • • • • 

GATTATTTAAATTGTTAAAGAATACACAATTTCGCGTATTGGGATTTTTCTCCTTTTC7C 

2290 2310 2330 

T GAG AC ATT G€ A €C A-TT T T AA TT T TTCT AACTGCT T ATT T AT GT GAAAAG.GGX.X ATXT XT--. 

2350 2370 2390 

ACTTAGCTTAGCTA7GTCAGCCAATCCGATTGCCTTAGGTGAAAGAAACCACCGAAATCC 

2410 - 2430 • 2450 

CTCAGGTCCCTTGGTCAGCAGCCTCTCAAGATTTTTTTTGTCAGAGGCTCCAAATAGAAA 

2470 2490 2510 

* • • • • * 

ATAAGAAAAGGTTTTCTTCATTCATGGCTAGAGCTACATTTAACTCAGTTTCTAGGCACC 

2530 2550 2570 

TCAGACCAATCATCAACTACCATTCTATTCCATGTTTGCACCTGTGCATTTTCTGTTTGC 

2590 2610 2630 

CCCCATTCACTTTGTCAGGAAACCTTGGCCTCTGCTAAGGTGTATTTGGTCCTTGAGAAG 

2650 2670 2690 

TGGGAGCACCCTACAGGGACACTATCACTCATGCTGGTGGCATTGTTTACAGCTAGAAAG 

2710 2730 2750 

CTGCACTGGTGCTAATGCCCCTTGGGA;vATGGGGCTGTGAGGAGGAGGATTAT^J\CTTAG 

2770 2790 2010 

GCCTAGCCTCTTTTAACAGCCTCTCAAATTTATCTTTTCTTCTATGGGGTCTATAAATGT 

2830 2850 2870 

4 

ATCTTATAATAAAAAGGAAGGAC AGGAGGAAGACAGGCAAATGTACTTCTC ACCC AGTCT 
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2890 2910 2930 

TCTACACAGATGGAATCTCTTTGGGGCTAAGAGAAAGGTTTTATTCTATATTGCTTACCT 

2950 2970 2990 

GATCTCATGTTAGGCCTAAGAGGCTTTCTCCAGGAGGATTAGCTTGGAGTTCTCTATACT 

70 

3010 3030 3050 

CAGGTACCTCTTTCAGGGTTTTCTAACCCTGACACGGACTGTGCATACTTTCCCTCATCC 

75 

3070 3090 3110 

• • • • ■ » 

ATGCTGTGCTGTGTTATTTAATTTTTCCTGGCTAAGATCATGTCTGA/vTTATGTATGAAA 

20 

3130 3150 3170 

ATTATTCTATGTTTTTATAATAAAAATAATATATCAGACATCGAAAAAAAAAA f 

25 



30 



35 



40 



45 



50 
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(2) 

s 10 30 50 

• • • • 

CAGCCGTGCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGACAGGGCCA 



10 — 



7 0 9 0 110 



GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
Me tGlyHisLeuSe rAlaProLeuHisArgValArgValProTrpGln 

130 150 170 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuThrAlaSe r LeuLeuThrPheTrpAsnProProThrThrAlaGlnLeu 

20 

190 210 230 

..... ^ 

ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSerrtetProPheAsnValAlaGluGlyLysGluValLeuLeuLeuValHis 

25 

250 270 290 

AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
30 AsnLeuProGlnGlnLeuPheGlyTyrSe rTrpTy rLysGlyGluArgValAspGlyAsn 

310 330 350 

35 CGTCAAATTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
ArgGlnlleValGlyTyrAlalleGlyThrGlnGlnAlaThrProGlyProAlaAsnSec 

370 390 410 

40 • 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAGAATGAC 
GlyArgGluThrlleTyrProAsnAlaSerLeuLeuIleGlnAsnValThrGlnAsnAsp 
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430 450 470 

. * • • • • 

"A C A G'G A TTCT A C AC CC T AC7CAGT C A T AAA GTCAGATCTTG TG AA T 6 AA G AAGCA-AC-TGG A. 
ThrGlyPheTycThrLeuGlnVallleLysSerAspLeuValAsnGluGluAlaThrGly 



490 510 530 

. • • • • • 

CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
70 GlnPheHisValTyrPro'GluLeuProLysProSerlleSerSerAsnAsnSerAsnPro 



550 570 590 

• • • • • • 

15 GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAlaValAlaPheThrCysGluProGluThrGlnAspThrThrTy r 

610 630 650 

20 * ' ' 

CTGTGGTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuTrpTrpI leAsnAsnGlnSe rLeuProValSe rProArgLeuGlnLeuSe rAsnGly 



25 670 690 710 

AACAGGACCCTCACTCTACTCAGTGTCACAAGGAATGACACAGGACCCTATGAGTGTGAA 
AsnArgThrLejThrLeuLeuSe rValThrArgAsnAspThrGlyProTy rGluCysGlu 



730 750 770 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
IleGlnAsnProValSerAlaAsnArgSe rAspProValThr LeuAsnValTh rTy rGly 



790 810 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
ProAspThr ProTh r 1 1 eSe rP roSe rAspThrTy rTy rArgProGlyAlaAsnLeuSe r 

40 



45 
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(' 



8S0 870 890 

. • • • • 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACA'GTACTCCTGGCTTATCAATGGAACA 
Leu S e rry STy rA IsKlsS e r A s n P r o P r o A-l aGl n Ty rSer-Tf pLeu-IieA^vGiy Th r 



910 930 950 

. . * • • • 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PheGlnGlnSerThrGlnGluLeuPhelleProAsnlleThrValAsnAsnSerGlySer 



970 990 1010 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
TyrThrCysHisAlaAsnAsnSe rValThrGlyCysAsnArgThrThrValLysThr lie 



1030 1050 1070 

ATAGTCACTGAGCTAAGTCCAGTAGTAGCAAAGCCCCAAATCAAAGCCAGCAAGACCACA 
IleValThrGluLeuSe rP roValValAlaLy sP roGlnl leLysAlaSe r Ly sTh rTh r 



1090 1110 1130 

GTCACAGGAGATAAGGACTCTGTGAACCTGACCTGCTCCACAAATGACACTGGAATCTCC 
ValThrGlyAspLy sAspSe rValAsnLeuThrCysSe rThrAsnAspThrGlylleSe r 



1150 1170 1190 

• • > * • » 

ATCCGTTGGTTCTTCAAAAACCAGAGTCTCCCGTCCTCGGAGAGGATGAAGCTGTCCCAG 
IleArgTrpPhePheLysAsnGlnSerLeuProSerSerGluArgMetLysLeuSerGln 



1210 1230 1250 

» • • • » • 

GGCAACACCACCCTCAGCATAAACCCTGTCAAGAGGGAGGATGCTGGGACGTATTGGTGT 
GlyAsnThrThrLeuSe rlleAsnPcoValLysArgGluAspAlaGlyThrTy rTrpCys 
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1270 1290 1310 

• - » ► • ■ 

GAGGTCTTC^CCCAATCAGTAAG^CCAAAGCGACCCCATCATGCTGAACGTAAACTAT 
GluValPheAsnProl leSe rLysAsnGlnSe rAspProIlerte tLeuAsnValAsnTy r 

1330 1350 1370 

• • • • • • 
AATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGGCCATTGCTGGCATTGTGATTGGA 

70 AsnAlaLeuProGlnGltfAsnGlyLeuSecProGlyAlalleAlaGlylleVallleGly 

1390 1410 1430 

• • • • » 

75 GTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTGCATTTCGGGAAG 
ValValAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeuHisPheGlyLys 

1450 1470 1490 

2 0 . . . . .■ 

ACCGGCAGCTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGAAGTTACTTATTC 
ThrGlySerSe rGlyProLeuGln 

25 1510 1530 1550 

TACCCTGAACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTCCCCATCCCTAAC 

30 1570 1590 1610 

AGCCACAGAAATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCTGAAAA^AAAAAAA 

35 1630 
AAAAAAAAAA 
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(3) 



5 

10 30 50 

CAGCCGTGC7CGAAGCGTTCCTGGACCCCAACCTCTCCTCCACAGGTGAAGACAGGGCCA 

10 

70 90 110 

GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGACTGCGTGTACCCTGGCAG 
75 MetGlyHisLeuSerAlaProLeuHisArgValAcgValProTrpGln 

130 150 170 

on GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCC AGCTC 

20 

GlyLeuLeuLeuThrAlaSe r Le uLe uTh r PheTrpAs nP roPcoThr Th rAlaGlnLeu 

190 210 230 

25 ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSe rMe tProPheAsnValAlaGluClyLysGluValLeuLeuLeuValHi s 



250 270 290 

30 . . . ... 

AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGC/^.C 
AsnLeuP roGlnGlnLeuPheGlyTy rSe rTrpTy r LysGlyGluArgVa lAspGlyAsn 



35 31 0 330 3 50 

CGTCAJ^TTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGC/^CAGC 
ArgGlnl leValGlyTy rAlal leGlyThcGlnGlnAlaThrProClyProAlaAsnSe r 

40 

370 390 U0 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTC ACCCAGAA.TGAC 
GlyArgGluThr I leTy rProAsnAlaSe rLeuLeuIleGlnAsnValThrGlnAsnAsp 

45 , 

430 450 470 

ACAGGATTCTACACCCTACAA.GTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGA 
50 ThrGlyPheTyrThrLeuGlnVallleLysSe rAspLeuVal AsnGluGluAl aTh rGl y 
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490 510 530 

. > * • • . • 

.CA.GTT£C&JGTA?AC.C_C.GGA^ 

GlnPheHi sValTy rProGluLeuProLysProiSe r 1 1 e S e~r S e c A s'KX s ri5~e rXsnPTo 



550 570 590 

• • • • • • 

GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAlaValAlaPheThrCysGluProGluThrGlnAspThrThrTy c 



610 630 650 

• ■ • • • • 

CTGTGGTGGATAAACAATCAGAGCCTCCCCGTCAGTCCCAGGCTGCAGCTGTCCAATGCC 
LeuTrpTrpIleAsnAsnGlnSe rLeuProValSe rProArgLeuGlnLeuSe rAsnGly 



670 690 710 

* • • * • • 

AACAGGACCCTCACTCTACTCACTGTCACAACCAATGACACAGGACCCTATGACTGTGAA 
AsnArgThr LeuThrLeuLeuSe r Va lTh rAr gAsnAspTh rGlyP roTy rGluCysGlu 



730 750 770 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
IleGlnAsnProValSerAlaAsnArgSerAspProValThrLeuAsnValThrTyrGly 



790 810 830 

CCGGACACCCCCACCATTTCCGCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
? r oA*rpTh r ProThri l-rS^r-P^-oS^ir-A'&p-Th^-Tyr -Ty-c Ac-g-P-toCl yAl a As n Le uS^-C 



850 870 e90 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGGAACA 
LeuSe rCysTyrAlaAlaSe r AsnP roProAlaGlnTy rSe rTrpLeuI leAsnGlyTh r 



910 930 950 

* • • • • 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAG TGGATCC 
PhcGlnGlnSe cTh r G 1 nGluLeuPhe I le P coAsnl 1 eTh r Va 1 As nAsnS e rGlySe c 



970 990 1010 

TATACCTGCCACGCC AATAACTC AGTC ACTCGCTGC AAC AGGACCAC AGTC AAGACGATC 
TyrThrCysHt s Ala AsnAsnSe rValThrGlyCy s AsnArgThr ThrValLy sThr He 



( EP 0 346 710 B1 ( 

1030 1050 1070 

ATAGTCACTGATAATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGGCCATTGCTGGC 
1 1 e va 1 Th r As pAs nAl aLeu P r oGl nGluAsnGly Le uSe r P r oGl yAl a IleAlaGly 

5 

1090 1110 1130 

ATTCTGATTGGAGTACTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTG 
10 lleVallleGlyValvalAlaLeuValAlaLeuIleAlaValAl aLeuAl aCy s Phe Leu 

1150 1170 1190 

75 CATTTCGGGAAGACCGGCAGCTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGA 
Hi sPheGlyLysThrGlySerSe cGlyProLeuGln 

1210 1230 1250 

AGTTACTTATTCTACCCTGAACTTTGAAGCCCAGCAACCCACACAACCAACT7CAGCCTC 

1270 1290 1310 

25 ...... 

CCCATCCCTAACAGCCACAGAAATAATTTATTCAGAAGTAAAAAAGCAGTAATCAAACCT 
1330 

30 

GAAAAAAAAAAAAAAAAAA 
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(4) 



1 acagcacagctgacagccgtactcaggaagcttctggatcctaggcttatctccacagag 60 

5 

61 gagaacacacaagcagcagagaccatggggcccctctcagcccctccctgcacacacctc 120 

MetGlyProLeuSerAlaProProCysThrHi sLeu 

L21 atcacttgga^^gg tc^t^tc«A9ca-tca^ttttaa-ac-ttctg9aa.tcc^cccaca 1.80 . 

IleThrTrpLysGlyValLeuLeuThrAlaSerLeuLeuAsnPheTrpAsnProProThr 

10 

181 actgcccaagtcacgattgaagcccagccacccaaagtttctgaggggaaggatgttc tt 24 0 
ThrAlaGlnValThrlleGluAlaGlnProProLysValSerGluGlyLysAspValLeu 

241 ctacttgtccacaatttgccccagaatcttgctggctacatttggtacaaagggcaaatg 300 
LeuLeuValHisAsnLeuProGlnAsnLeuAlaGlyTyrlleTrpTyrLysGlyGlnMe t 

75 

301 acatacgtctaccattacattacatcatatgtagtagacgg.tcaaagaattatata tggg 360 
ThrTyrValTyrHisTyrlleThrSerTyrValValAspGlyGlnArgilelletyrGly 

361 cctgcatacagtggaagagaaagagtatattccaatgcatccctgctgatccagaatgtc 4 20 
ProAlaTyrSerGlyArgGluArgValTyrSerAsnAlaSerLeuLeuIleGlnAsriVal 

20 

.4 21. acgcaggaggatgcaggatcctacaccttacacatcataaagcgacgcgatgggactgga 480 
ThrGlnGluAspAlaGlySerTyrThrLeuHisllelleLysArgArgAspGlyThrGry 

4 81 ggagtaactggacatttcaccttcaccttacacctggagactcccaagccctcca tctcc 54 0 
GlyValThrGlyKisPheThrPheThrLeuHisLeuGluThrProLysProSerlleSer 

25 

541 agcagcaacttaaatcccagggaggccatggaggctgtgatcttaacctgtga tec tgcg 600 
SerSerAsnLeuAsnProArgGluAlaMetGluAlaVallleLeuThrCysAspProAla 

601 actccagccgcaagctaccagtggtggatgaatggtcagagcctccctatgactcacacg 66 0 
ThrProAlaAlaSerTyrGlnTrpTrpMetAsnGlyGlnSerLeuProMetThrHisArg 

30 

6 61 ttgcagctgtccaaaaccaacaggaccctctttatatttggtgtcacaaagtatattgca 72 0 
LeuGlnLeuSerLysThrAsnArgThrLeuPhellePheGlyValThrLysTyrlleAla 

f2l ggaccctatgaatgtgaaatacggaacccagtgagtgccagccgcagtgacccagtcacc 780 
GlyProTyrGluCysGluIleArgAsnProValSerAlaSerAcgSerAspProValThr 

35 

781 ctgaatctcctcccaaagctgtccaagccctacatcacaatcaacaacttaaaccccaga 84 0 
LeuAsnLeuLeuProLysLeuSerLysProTyrlleThrlleAsnAsnLeuAsnProArg 

8 41 gagaa taagga t g tc t ta a cc t tcacctgtgaacctaa gag tgagaac tacacct a catt 900 

GluAsnLysAspValLeuThrPheThrCysGluProLysSerGluAsnTyrThrTy rl le 

40 

9 01 tggtggctaaatggtcagagcctccctgtcagtcccagggtaaagcgacccattgaaaac 96 0 

TrpTrpLeuAsnGlyGlnSerLeuProValSerProArgValLysArgPcoIleGluAsn 

961 agga tec tea ttctacccaa tg tcacgagaaa tgaaacaggacc tta tea a tgtgaaata 1020 
ArglleLeuI leLeuP roAsnValThrArgAsnGluTh rGlyProTy rGlnCysGluI le 

45 10 21 egggaccgata tggtggcatccgcagtgacccagtcaccctgaatgtcctctatgg tcca 108 0 
ArgAspArgTyrGlyGlylleArgSerAspProValThrLeuAsnValLeuTyrGlyPro 
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1081 gacctccccagcatttacccttcattcacctattaccgttcaggagaaaacctctacttt 1140 
AspLeuProSerlleTyrProSerPheThrTyrTyrArgSerGlyGluAsnLeuTyrPhe 

1141 tcctgcttcggtgagtctaacccacgggcacaatattcttggacaattaatgggaagttt 120 0 
SerCysPheGlyGluSerAsnProArgAlaGlnTyrSerTrpThrlleAsnGlyLysPhe 

-1201 cagctatcaggacaaaagctctctatcceccaaataactacaaagcatagtgggctctat 1260 
GlnLeuSerGlyGlnLysLeuSerlleProGlnlleThrThrLysHisSerGlyLeuTyr 

1261 gcttgctctgttcgtaactcagccactggcaaggaaagctccaaatccatcacagtcaaa 132 0 
AlaCysSerValArgAsnSerAlaThrGlyLysGluSerSerLysSerlleThrValLys 

1321 gtctctgactggatattaccctgaattctactagttcctccaattccattttctcccatg 138 0 
ValSerAspTrpIleLeuProEnd 



J>£81 gaatcacgaagagcaagacccactctgttccagaagccctataatctggaggtggacaac 1440 

X4 41 tcgatgtaaatttcatgggaaaacccttgtacctgacatgtgagccactcagaactcacc 1500 

1501 aaaatg ttcgacaccataacaacagc tactcaaactgtaaaccagga taagaag ttga tg 15 6 0 

1561 acttcacactg tggacag tttttcaaagatgtcataacaagactccccatcatgacaagg 1620 

1621 c tccaccctc tactgtctgctcatgcctgcctctttcacttggcaggataatgcagtcat 1680 

1681 tagaatttcacatgtagtagcttctgagggtaacaacagagtgtcagatatgtcatctca 174 0 

1741 acctcaaact tttacgtaacatctcagggaaatgtggc tctctccatc ttgca tacaggg 1800 

1801 ctcccaa tagaaatgaacacagagatattgcctgtgtgtttgcagagaagatggtttc ta 18 60 

1861 taaagag taggaaagc tgaaa ttatagtagagtctcc t t taaatgcaca ttgtg tgga tg 192 0 

1921 gctctcaccatt tcctaagagatacag tgtaaasiacgtgacagtaa tactga ttctagca 198 0 

1981 gaataaacatgtaccacatttgcaaaaaa ~ 2010 
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(5) 

1 gggtggatcctaggctcatctccataggggagaacacacatacagcagagaccatggga 59 
5 MetGly 

60 cccctctcagcccctccctgcactcagcacatcacctggaaggggctcctgctcacagca 119 
ProLeuSerAlaProProCysThrGlnHisIleThrTrpLysGlyLeuLeuLeuThrAla 

120 tcacttttaaacttctggaacctgcccaccactgcccaagtaataattgaagcccagcca 179 
10 SerLeuLeuAsnPheTrpAsnLeuProThrThrAlaGlnValllelleGluAlaGlnPro 

180 cccaaagtttctgaggggaaggatgttcttctacttgtccacaatttgccccagaatctt 239 
ProLysValSerGluGlyLysAspValLeuLeuLeuValHisAsnLeuProGlnAsnLeu 

24 0 actggctacatctggtacaaagggcaaatgacggacctctaccattacattacatcatat 299 
75 ThrGlyTyrlleTrpTyrLysGlyGlnMetThrAspLeuTyrHisTyrlleThrSerTyr 

^300 gtagtagacggtcaaattatatatgggcctgcctacagtggacgagaaacagtatattcc 359 
ValValAspGlyGlnllelleTyrGlyProAlaTyrSerGlyArgGluThrValTyrSer 

360 aatgcatccctgctgatccagaatgtcacacaggaggatgcaggatcctacaccttacac 419 
20 AsnAlaSerLeuLeuIleGlnAsnValThrGlnGluAspAlaGlySerTyrThrLeuHis 

420 atcataaagcgaggcgatgggactggaggagtaactggatatttcactgtcaccttatac 479 
IlelleLysArgGlyAspGlyThrGlyGlyValThrGlyTyrPheThrValThrLeuTyr 

480 tcggagactcccaagcgctccatctccagcagcaacttaaaccccagggaggtcatggag 539 
25 SerGluThrProLysArgSerlleSerSerSerAsnLeuAsnProArgGluValMetGlu 

540 gctgtgcgcttaatctgtgatcctgagactccggatgcaagctacctgtggttgctgaat 599 
AlaValArgLeulleCysAspProGluThrProAspAlaSerTyrLeuTrpLeuLeuAsn 

600 ggtcagaacctccctatgactcacaggttgcagctgtccaaaaccaacaggaccctctat 659 
30 GlyGlnAsnLeuProMetThrHisArgLeuGlnLeuSerLysThrAsnArgThrLeuTyr 

660 ctatttggtgtcacaaagtatattgcagggccctatgaatgtgaaatacggaggggagtg 719 
j LeuPheGlyValThrLysTyrlleAlaGlyProTyrGluCysGluIleArgArgGlyVal 

720 agtgccagccgcagtgacccagtcaccctgaatctcctcccgaagctgcccatgccttac 779 
35 SerAlaSerArgSerAspProValThrLeuAsnLeuLeuProLysLeuProMetProTyr 

780 at caeca tcaacaacttaaaccccagggagaagaaggatgtgttagccttcacctgtgaa 839 
IleThrlleAsnAsnLeuAsnProArgGluLysLysAspValLeuAlaPheThrCysGlu 

84 0 cctaagagtcggaactacacctacatttggtggctaaatggtcagagcctcccggtcagt 899 
40 P^oLysSerArgAsnTyrThrTyrlleTrpTrpLeuAsnGlyGlnSerLeuProValSer 

900 ccgagggtaaagcgacccattgaaaacaggatactcattctacccagtgtcacgagaaat 959 
ProArgValLysArgProIleGluAsnArglleLeuIleLeuProSerValThrArgAsn 

960 gaaacaggaccctatcaatgtgaaatacgggaccgatatggtggcatccgcagtaaccca 1019 
45 GluThrGlyProTyrGlnCysGlulleArgAspArgTyrGlyGlylleArgSerAsnPro 
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1020 g tcaccc tgaa tg tec tc ta tgg tccagacctccccagaatttaccct tact tea cc tat 1079 
ValThrLeuAsnValLeuTyrGlyProAspLeuProArglleTyrProTyrPheThrTyr 

1080 taccgttcaggagaaaacctcgacttgtcctgctttgcggactctaacccaccggcagag 1139 
5 TyrArgSerGlyGluAsnLeuAspLeuSerCysPheAlaAspSerAsnProProAlaGlu 

114 0 tatttttggacaattaatgggaagtttcagctatcaggacaaaagctctttatcccccaa 1199 
TyrPheTrpThrlleAsnGlyLysPheGlnLeuSerGlyGlnLysLeuPhelleProGln 

1200 attactacaaatcatagcgggctctatgcttgctctgttcgtaactcagccactggcaag 1259 
10 IleThrThrAsnHisSerGlyLeuTyrAlaCysSerValArgAsnSerAlaThrGlyLys 

1260 gaaatctccaaatccatgatagtcaaagtctctggtccctgccatggaaaccagacagag 1319 
GluIleSerLysSerMetlleValLysValSerGlyProCysHisGlyAsnGlnThrGlu 

1320 tctcattaatggctgccacaatagagacactgagaaaaagaacaggttgataccttcatg 1379 

*5 SerHisEnd 

T380 aaattcaagacaaagaagaaaaaggctcaatgttattggactaaataetcaaaaggataa 14 39 

14 40 tgttttcataatttttattggaaaatgtgctgattcttggaatgtttt r attctccagatt 14 99 

1500 tatgaactttttttcttcagcaattggtaaagtatacttttgtaaacaaaaattgaaaca 1559 

1560 tttgcttttgctctctatctgagtgccccccc 1591 

20 



2. A replicable recombinant cloning vehicle having an insert comprising a nucleic acid of claim 1 . 
25 3. A cell that is transfected, infected or injected with a recombinant cloning vehicle of claim 2. 

4. A method for preparing a polypeptide, said method comprising the steps of 

(a) culturing the cell of claim 3 

(b) recovering the polypeptide expressed by said cell. 

30 

5. A method for preparing an antibody directed against a polypeptide said method comprising the steps of 

(a) preparing said polypeptide by the method of claim 4 

(b) injecting said polypeptide into a host capable of producing antibodies and 

(c) recovering said antibodies. 

35 

Patentanspruche 

1. Nucleinsaure, umfassend eine Basen-Sequenz, die fur eine Peptid-Sequenz codiert, 

dadurch gekennzeichnet, daS die Gruppen-Nucleinsaure eine DNA ist, die aus der folgenden Gruppe 
40 von ftinf Sequenzen ausgewahlt ist: 
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10 30 SO 

• ♦ • * 

CAGCCG7GC7CGAAGCGTTCCTGGAGCCCAAGCTCTCC7CCACAGGTGAAGACAGGGCCA 
70 90 110 

» • • • • . 

GCACCAGACACCA7GGGCCACC7C7CAGCCCCAC77CACAGACTGCC7G7ACCC7GGCAC 
ne iClyKi sLeuSe rAl&ProLeuHi sAr gValAr yval?r oTrpGln 

120 150 1/0 

CGCC77C7GC7CACACCCTCAC77CTA^CC77CTGGA^XCCCCCCACCACTGCCCAGC" C 
G 1 y * v e v *- s u T h : A i & S e r L t u L e u T h r ? h * 7 r p A. s n ? r o ? r c 7 h r T r A 1 *. G 1 «M -2 - 

150 210 230 

ACT A C 70 AA7 C C A 7G C C ATTC A-A 7G 7TGC AC AG G G G AAG G A G G 7 7 C 7 7 C 7 C C T7C 7 C C A I 
20 7hr 7h:GluSe rre t?ro?heAsnVeiAl cGluGlyLysGluVal Lc wis uLe uVeIh i s 

2id 210 290 



70 



75 



25 



.^A7C7GCCCCAGCAAC77777GGC7ACAGC?GG7ACA^AGGGGAAAGAG7GGA7GC-CAAC 
AsnL»u?:cClriG2 nLsurheCl yTy zZe r?:c7y rLysGlyGi uAr 5 V* 1 AipGl yA£r. 

310 330 350 • 

CC7CAJ^77G7AGGA7A7GCAA7AGGA-ACTC;^ACA^CC7AC 
30 ArcGlM 1 • v a 1 Gl yTy r Al e • 1 e Gl yTh i CI r.GlnAlaTh : ? : cCl y ? r oa ItAsnSer 

370 3r0 <10 

35 GG7CGAGACACA^7A7ACCCCAATGCATCCCTGC7CATCCAGAA.CCTCACCCAGA.ATGAC 
Gl yAr 5GI u7hr I 1 e7y r P r oAsnAl a Se r LeuLtul 1 cGl nAsnVa 1 Th r Gl nA t nAsp 

00 «S0 «"70 

40 AC AGG ATTCTAC ACCCTACAAGTCA7AAAGTC AC ATCTTGTGAA7GAAC AACC AACTGGA 
7hf GlyPhcTy tThrLeuGlnval 1 1 ctysSc r AspLeuVe 1 AsnGl uGl uAl aTh r Gi y 
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;9C 510 ^30 

CAG7TCCATCTATACCCGCAGCTGCCC^\GCCCTCCATCTCCAGCAAC^ACTCCAACCCT 

GinPheKi sv» l~y : ?roGluLeu?r oLys?roSe r 1 1 e Se rSe r AsnAsnSe rAsn?r o 

5 

55: 570 590 

G7GGAGGAC.---.GGA7GC7G7GCCC7TCACC7C7GAACC7GAGAC7CACGACACAACC7AC 
10 valGluAsy'.ysAspAlaValAlaPheThrCysCluProGluThrGlnAspThrThrTyr 

6 i C 630 650 

75 CTGTGGTGGATA^nC.A-ATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCC.kATCGC 
LeuTr pTrp: 1 * AsnAsnGlnSe rLeuProValSerProArgLeuGlnLeuSerAsnGly 

S7v 690 710 

20 A-ACAGGACCCTC ACTCTACTCAGTGTCACAAGGAATGACACAGGAC CCTATGACTGTGA.A 
AsnArcTh:L;cThrleuleuSe r ValTh rAr gAcnAspTh rCl y? r oTy rGlcCysGlu 



25 



30 



HZ 730 770 

A7AC AG.--ACCCAGTGAGTGCGAACCGCAGTGA.CCCAGTCACCTTGAATGTCACCTATGGC 
! ! eGl r. As r.? : cv* 1 Se r Ala As nA rcSe t As??r oVaiThr LeuAsnValThrTy rGl y 

7 S : 810 8 3 0 

C C GG A C AC C C Z C A C C ATTTC C C C T7C AGACAC C 7A77AC CG7C C AG GGGC AA~AC C 7C AGC 
?rcAs?7*"?:o?hr I leSe rProSc rAspThr7y rTy rArg? r oGlyAlaAsnLeuSe r 

35 35: 870 aso 

• » • • • 

C7C7CC7GCTA7GCAGCC7CTA^CCCACCTGCACAC7AC?CC7GGC7TA7C.A.ATCGA^CA 
LeuSe rCysTy rAiaAlaSc rAsn?ro?roAlaGln7yrSe r7rpLeuI i eAsnGly7hr 
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910 930 950 

• « ■ 

7TCCAGC^--.GCACACXAGAGC7C777A7CCC7A.ACA7CAC7G7GAA7A^7AG7GGA7CC 
PheClnClnSe r 7h r Gl nGl uLc u?he 1 1 c P roAsn I i eT-h r Va 1 AsnAsnS e i GlySe r 



970 990 1010 

• • * - » 

TATACCTGCCACCCCKATAACTCAGTCACTCGCTGCAACAGCACCACAGTCAACACCATC 
TyfThrCysHisAUAsnAsnSerVdlThrClyCysAsnAcgThtThrVdlLysT^r lie 
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1020 1050 1070 

A7AG7CAC7GA7AATGCTC7ACCACAACAAAA7CGCCTC7CACC7GGGGCCA77GCTGGC 
1 1 e valTh rAspAsnAl a Leu? r oGlnGluAsnCl yLcuSe r ? r oGlyAl i 1 1 eAl aCly 

5 

1090 1110 1130 



10 



A77G7GA77CG AG7AG7GGCCCTGG77GCTC7GA7AGC AG7AGCC C7GGCA7G7777C 7G 
1 level 1 1 eGlyvelVdlAlftLcuVdlAlaLeullcAlfeV&lAliLcuAlsCysPheLeu 

us: mo 1190 

CA777CGGGAAGACCGGCAGGCCAAGCGACCAGCG7GATCTCACAGAGCACAAACCC7CA 
75 Hi s?hcGlyLysThrClyArgAlaScrAspGlnArgAspLeuThrGluHisLys?ro5e r 

1210 1230 1250 

20 G7C7CC AACCACAC7CACGACCAC7CCAA7GACCCACC7AACAAGA7GAA7GAAGTTACT 

V4 1 Se r As nHi sThrGlnAspHi sSe lAsnAsp? r o? r oAsnty stte t AsnGluVd 1 T. w . r 

■ 

1270 1290 1310 

25 TATTCTACCC?CAAC??TCAACCCCACCA^.CCCACACXACCAACrTCACCC?CCCCATCC 
Ty r Se r T'n r LeuAsnPheCluAi eClnCln? r oTh tCln? r oTh r Se rAleSer?roSer 

1 3 3 3 13 SO 1310 

30 • • • ■ • 

C7AACAGCCACAGAAATAA7T7A7TCAGAAG7>>>^AAGCAG7A^7GAAACC?G7CC7GC 
Leu7hrAla7hrGluIle2ie7yrSerCluV&lLysLysGLn 

35 1350 1410 K30 

7CAC7GCAG7GC7GA7G7A77TCAAG7C7C7CACCC7CA7CAC7AGCAGA77CCT77CCC 

1450 1470 1490 

40 

C7G7AGGG7AGAGGGG7GGGGACAGAAACAAC777C7CC7AC7C77CC77CC7AA7AGGC 
1510 1530 15S0 

45 

A7C7CCAGGC7CCCTGGTCACTGCCCC7C 7CTCAC TGTCAA TAG A7GAAAGTAC A 77GCG 
t 

1570 1590 1610 

50 * " ' ' " 

AGrCTGTAGGAAACCCAACCT7CTTG7CA7TGAAATTTGGCAAACCTGACTTrGGGAAAG 
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1630 1630 1610 

ACCGACCACAAC77CCCC7CCC77CCCCTT77CCCAACC7GCAC7TC7T77AAAC T7CCC 

5 

16S0 . i7i0 1730 

* 

7C77CAGAGCA.C7CA77CC7TCCCACCCCCAGTCC7GTCCTA7CAC7C7AA77CGGA777 

w 

1150 1770 1790 

GCCA7AGCC7;GAGG77A7G7CCT777CCAT7AJ<CTACA7G7GCCAGGAAACAGCGA.GAG 

75 

1810 1B30 1850 

* * • • 

AGACAAAGTA.~-nCGGCACTAATGCTTCTCC7'ATTTC7CCAAAGCCTTGTGTGAACTAGC A 

20 

1670 1890 - 191CT 

AAGAGA^GAAAATC.A-^ATATATAACC/^TAGTCAAATGCCACAGGTTTGTCCACTGTCAG 

1930 1950 1970 

GGTTGTCTACCTGTAGGATCAGGGTCT/^GCACCTTGGTGCTTAGCTAGAATACCACCTA 

30 19S0 2010 2030 

A7CCT7C7GGC.-^.GCC7G7C77CAGAGAACCCACTAGAAGCAACTAGGA>.AA^7CAC77G 

35 2050 2070 2090 

CCAAAA7CCAAGCCAATTCCTGATGGA>AATGCAAAAGCACATATA7G7TTTAATA7CT7 

40 21 10 21 30 2150 

7A7GGGC7C7C77CAAGGC AG7GC7GAGAGGGAGGGGT7A7AGCT7CAGGAGGGAA.CC AG 

45 2 1 70 21 90 2 2 1 0 

C7TC7CA7AAAqACAA7C rGC7AGGAACT7GCCAAAGCAATCAGACAGCTGCCCTTCAGC 
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2 2 3 C 22SO 2 2 7 0 

GA77A777AAA77G77AAAGAA7ACACAATT7GGCG7A77GGGA77777C7CCT77TC7C 

5 

2 2 9 0 2 310 2 3 3 0 

7GAGACA77CCACCA77 7 7.-A 7T TT7G T AAC TG C TT A T TT A 7G TCAAAAC G G7 7 A77777 

10 

2 3 S : 2 37 0 2 39 0 

AC77AGC77AGC7A7G7CAGCCAA7CCGATTGCC77ACG7CAAAGAAACCACCCAAA7CC 

75 

2<10 2430 * 2450 

CTCAGG7CCC77CC7CACGAGCC7CTCAAGATT77TTTTGTCAGAGCCTCCAJ^7AGAAA 

20 

2470 2490 2510 

* • 

A7AAGAwAAAGG7777C77CA77CA7GCC7AGAGCTAGATT7AACTCAG7T7CTAGGCACC 

25 2 530 2550 2570 

7CAGACCAJV7CA7CAAC7ACCAT7CTATTCCATGTTTGCACCTGTGCATT7TCTGTT7GC 

so 2 59 : 2610 2630 

* 

CCCCA77CAC777G7CAGG.AAACCTTGGCCTC7GCTA^GGTGTATTTGGTCCT7GAGAAG 
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2650 2670 2S9C 

■ • • • . 

7GGGAGCACCC7ACAGGGACAC7ATCACTCATGCTGGTGGCATTGTTTACAGCTAG^V.G 

2710 2730 2750 

C7GCAC?GG7GC7A_A7GCCCCTTGGGAA/\7GGGGC7GTGAGGAGGAGGAT?A7AJiC?TAG 

2770 2790 2010 

G C C T AG C C 7 CT777 A.-. C AG C C T C TG AAA T7T ATC TTTT C TT C T A 7CGGG7CTA T AA A 7 G T 

2830 2050 2870 

* 

ATCT 7 A TAA T AAAAAGGAAGG AC AGGACGAACAC ACGC AAATGTAC TTC7C AC C C AG7C T 
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2590 2S10 2930 

7C7AC AC AGA7GGAA7C7C777GGGGC7AAGAGAAAGG7777A7TC7ATATTGC7TAC C T 

5 

2S50 2970 2990 

GA7C7CA7G77AGCCC7.\AGAGGC777CTCCAGGAGGA77AGC7TGGAG77C7C7ATAC7 

10 

3010 3030 3050 

CAGGTACCTCTTTCACGG7TTTCTAACCCTGACACCGACTGTGCATACTTTCCCTCATCC 

75 

3 07 0 3090 31 10 

ATGCTGTGCTGTGTTATTTAATTTTTCCTGGCTAACATCATGTCTGA/vTTATGTATGAAA 

20 

3130 3150 3170 

A7TA77C7A7GT7777A7AA7AAAAA7AA7A7A7CAGACATCCAAAAAAAAAA , 

25 
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